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Alexa  makes  the  Web  history 


By  Alex  Lis 


July  9.  1997.  1;05  p.m.  PT 

Confused  or  swamped  Web  surfers  will  soon  have  yet  another 
tool  at  their  disposal  as  the  .Alexa  Internet  Web  navigation  service 
goes  into  beta  next  week. 

The  new  sep.-ice  is  run  by  the  creators  of  the  Fncemet  .Archive,  a 
project  that  a:ms  to  document  and  store  Web  pages.  Usenet 
records,  and  shareware  for  the  historical  record. 

The  .Aiexa  client  is  a  toolbax-  that  sits  at  the  bottom  of  a  browser  as 
ti-.e  user  sons  the  Web.  Once  downloaded  and  installed,  the 
.Alexa  client  will  use  a  bit  to  analyze  each  Web  site  visited.  The 
information  it  gathers  will  then  be  displayed  on  the  toolbar.  .AJexa 
will  relate  information  on  the  site's  traffic,  where  the  site  is 
physically  located,  the  number  of  pages  it  contains,  and  when  the 
site  was  last  updated,  according  to  a  company  spokesperson. 

.Alexa  will  also  display  relevant  li.nks  to  other  sites  by  analyzing 
the  paths  oi  previous  visitors.  .-Mexa  will  add  the  URLs  it 
analyzes  to  the  Internet  .Arcaive. 

Tne  sofr^-are  will  be  available  to  download  for  free  from  the 
•Alexa  Web  site.  The  company  would  not  comment  on  how  it 
plans  to  bring  in  revenue. 
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Startup  Seeks  To  Synthesize  Web 

By  John  Gartner 

S.-VN  FR.AJ^'CISCO  --  Stanup  Alexa  Internet  announced  a  product  Thursday  with  lofty 
aspirations:  to  re-engineer  the  wav  people  browse  the  Web.  The  Web-veteran  founders  of 
A!e\a.  located  in  the  recently  demilitarized  Presidio  compound  in  San  Francisco,  have 
unwrapped  a  browser  companion  application  that  focuses  on  the  user  experience  instead  of  on 
text  searching  to  guide  people  through  the  Internet. 

The  Alexa  toolbar  client  provides  users  with  detailed  information  about  where  they  are  and 
where  they  might  want  to  go  next.  .Alexa  will  distribute  the  client  for  free  and  will  receive 
income  by  broadcasting  ads  based  on  the  content  currently  being  viewed. 

For  each  site  visited,  the  .\lexa  app  displays  the  parent  company  and  provides  ratings  for  the 
depth  and  quality  of  information  based  on  the  amount  of  information  and  how  long  previous 
visitors  remained  at  the  site. 

Tne  'Where  To  Go  "  information  suggests  comparable  sites  based  on  where  users  most 
frequently  branched  off  to  next  and  provides  a  related  advemsement.  For  example,  if  a  user 
visits  a  Ford  Mustang  enthusiast  page.  Alexa  might  display  a  car  advertisement  and  related 
sites  on  auto  repair. 

Alexa  president  and  CEO  Brewster  Kahle  is  no  stranger  to  grandiose  designs  for  changing  the 
way  people  access  information.  Kahle  invented  the  Wide  .Area  Information  Server  (WAIS) 
information  retneval  system,  which  he  later  sold  to  .America  Online.  He  also  worked  as  lead 
engineer  on  the  Thinking  Machines  parallel-processing  supercomputer  project. 

Pn\  atelv  funded.  .Alexa  was  founded  in  .Apnl  1996  bv  Kahle  and  fellow  W.AIS-veteran  Bruce 
Giihat. 

.Alexa  s  client/server  architecture  "sniffs  "  each  user's  trail  around  the  Web  and  sends  a  copy  of 
that  path  back  to  .Alexa  ser.ers.  This  does  not  threaten  individual's  pnvacy.  Kahle  said, 
because  only  the  pathway  is  recorded  and  not  who's  browsing. 

Thjs  extra  data  requires  less  than  10  percent  of  a  user's  bandwidth.  Kahle  said.  Once 
installed,  the  toolbar  stans  up  with  the  browser,  but  it  can  be  selectively  turned  off.  The 
suggestions  of  where  to  go  are  based  on  Alexa's  archive  of  the  Web  that  is  conunually 
reindexed  and  re-evaluated.  Processing  the  2  terabytes  of  Web  data  is  no  big  deal  for  someone 
used  to  working  in  the  supercomputing  arena.  Kahle  said.  "The  Internet  is  tmallv  big  enough 
to  look  at." 

In  addition  to  the  surfing  assistant.  .Alexa  includes  a  real-time  chat  client  for  instant 
communication  with  other  .Alexa  users. 


.Alexa  stores  and  processes  the  data  at  its  headquarters  using  racks  of  Sun  SparcStaiions  and 
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Thursday,  July  17,  1997 

Alexa   Internet   Introduces   Web  Navigation  That  Learns  From  People 

■  Free  Toolbar  Adds  Concext.  Conunuitv  and  Efficiencv  to  Web  Searchme 


PREV  STORY  JP^^i  .\N  FR.\NCISCO--Ale.xa  Incemec  announced  today  a  new  Web  navigation 

p^^J^  service  that  learns  from  people.  The  ser^•ice.  Alexa.  is  tree  to  users  of  the 
Nl^.XT  ^TOp>'  Kfci^l  World  Wide  Web  and  provides  a  continuous  source  of  relevant 

recommendations  of  where  to  go  next  on  the  Web.  These  recommendations 
are  denved  by  analyzing  patterns  withjn  the  W'eb  and  anonymous  usage  paths.  .Alexa 
works  in  parallel  with,  and  independently  ot.  Web  browsers  and  is  displayed  as  a 
toolbar  on  the  user's  desktop.  Users  can  download  the  beta  version  of  the  service 
starting  today  at  hup'  ''.v.a".  .iicv_i.j.Mn  ".A.Iexa  learns  from  people's  use  of  the  .\et.  so 
that  future  users  can  find  the  good  stuff  they  know  is  out  there."  sajd  .Alexa  president 
Brewster  .^;ahle.  'In  1993,  the  Web  allowed  ever. one  to  become  a  publisher,  in  1997, 
.Alexa  'A ill.  in  effect,  allow  everyone  to  become  an  editor. ' 
.Alexa;  Know  Where  You  .Are  and  Where  To  Go  Next 

The  free  .Alexa  service  is  a  toolbar  that  gives  you  information  about  where  you 
are  on  the  W'eb  and  where  to  go  next.  The  client  works  with  Web  browsers  and  is 
displayed  at  the  bottom  of  your  screen.  .Alexa  offers  immediate,  tangible  benetits  for 
W'eb  users  --  sawy  and  inexpenenced  alike. 

For  example,  if  you  search  the  Web  for  mountain  bike  information  you  might 
rlrst  use  your  favonte  search  engine  or  directory.  When  you  arrive  at  a  mountain  bike 
sue.  .Alexa  lets  you  know  how  others  have  liked  this  site  --  whether  they  have  backed 
out  or  decided  to  click  through  --  and  what  paths  through  the  site  most  users  followed. 

If  you  decide  to  compare  other  brands  ci  bikes,  you  won't  have  to  return  to 
your  search  engine  or  directory  and  begin  a  new  search. 

Instead.  .Alexa  offers  continual  access  to  a  list  of  sites  frequented  by  others 
interested  m  the  topic.  .Alexas  archr-e  of  the  Web  virtually  eliminates  the  nsk  of 
finding  a  dead  Liak  when  a  page  is  no  longer  available  on  the  Web. 

.Alexa  users  have  easy  access  to  four  pnmary  features  contained  m  the  .Alexa 
toolbar: 
-  .  Where  am  P 

.Alexa  provides  you  with  live  information  on  every  site  you  visit  such  as: 

--  Who  the  site  is  registered  to  --  How  many  pages  are  on  the  sue  --  How  many 
others  point  to  this  sue  -  How  frequently  the  site  is  updated  --  The  site's  populanty 
among  our  users 

Where  should  I  go  next'!* 

.Alexa  uses  the  content  of  the  Web  and  the  insights  of  its  users  to  help  you 
decide  v-here  to  go  next.  By  analyzing  patterns  within  the  Web  and  anonymous  usage 
paths.  .Alexa  provides  you  with  a  contmuous  source  of  relevant  sues  that  search 
engines  vvould  not  suggest. 

What  happened  to  my  page^ 

.Alexa  is  the  only  service  that  provides  you  wuh  quick,  automatic  access  to  an 
archive  of  the  entire  public  content  of  the  Web.  Now.  instead  of  your  browser 
displaying  a  "-104  -- 

Not  found"  message,  .Alexa  will  retneve  a  copy  of  the  page  from  our  archive. 

W'ho  else  is  online'^ 

.Alexa  lets  you  know  which  of  your  fnends  and  colleagues  are  online  so  you  can 
communicate  with  them  in  real-tim.e. 

"Scuba  divers  use  the  buddy  system  when  navigating  underwater;  cybersurfers 
need  similar  help, '  said  Jerry  Michalski.  .Managing  Editor  of  Release  1 .0.  "Because  it 
combines  access  to  publicly  available  Internet  inl'ormation  with  useful  analysis  of 
member  activity  and  smart  use  of  Internet  resources.  .Alexa  offers  such  help.  The 
service  should  make  using  the  Internet  simpler  and  more  compellmcr  for  many 
people. ' 
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The  Collective  \Visdom  o!'  th 
[nierr.ec  search  ;ngines  porrcr 

:2rch  engines  pro^!^^  no  ooruex: 

-ejdt: 


Web  --  People 

1  searches  based  on  a  few  ke;. words.  These 
\!de  no  oof.tex:  vvhen  recuming  suggested  links  ;o  the  user.  The 
r~che  information  that  the;,  have  gathered  and  indexed  is  their  strength: 
difrlcu!;--  of  use  ar-i  over-generaijty  are  their  weaknesses. 

[n  ;:rr,panscr.  -Ale.xa's  e%er-changing  and  context-sensitive  hsts  of  suggested 
s;:es  :c  -isit  are  c.'-^t<^d  from  the  ar,al>sis  of  the  combination  of  the  overall  traffic 
::c[ter-.s  rf  the  V^-'b.  the  ecology  c:  links  between  sues,  the  content  o:  the  Web  pages 
arid  ancr,'.  mou-  'Jsage  paths.  In  essence.  Alexa  harnesses  the  collective  wisdom  of  the 
^^eb  b'.  takin.-' '^'^'•■^t^--  of  its  most  killer  app  --  the  people  'Aho  use  it. 

"Alexa  provides  conte.xt-sensKive  advertising  with  our  "Where  to  go  next' 
sui2est:on>-'  ^^^  Bruce  Gilliat.  General  Manager  of  Ale.xa  Internet.  "While  users 
■Aijfrece;''^  advertisements  when  using  the  ser.ice.  the  ads  will  be  relevant  to  what 
>cu  are  ■:S'^ing  and  in  many  cases  may  act  as  additional  site  suggestions."  The  beta 
per.od  ■•'il  be  limited  to  the  first  10.000  users  to  register  and  download  the  client.  The 
serM'.-  ■■^ill  then  be  unavailable  for  download  unul  it  is  available  for  general 
disrnbuticn  toward  the  end  of  this  summer.  .Alexa  will  irutiaJly  be  compauble  with 
broy-sers  that  run  on  a  Windows95  or  Windows  NT  operating  system. 
.Alexa  Internet  Partners 

.Alexa  is  working  with  several  strategic  partners.  Encyclopaedia  Bntannica  and 
.Ale.xa  have  agreed  to  integrate  a  suite  of  content  ser/ices  to  provide  a  comprehensive 
desktop  reference  tool.  .Alexa  and  TRUSTe.  a  global,  non-profit  initiative  to  establish 
trust  and  confidence  in  electronic  communication,  are  working  together  to  give  users 
more  inicrmation  to  make  wise  Internet  decisions. 

Additionally.  Alexa  Internet  is  working  with  .Apte.x  to  apply  the  .Apte.x 
ConvectisiTM)  software  to  the  problem  of  context-sensitive  suggestions. 
.About  .Ale.xa  Internet 

.Alexa  Internet  analyzes  multi-terabyte  collections  of  data  to  create  navigation 
ser.ices.  Founded  in  .Apnl  1996  b>  Brewster  Kahle  and  Bruce  Gilliat.  .A lexa~ Internet 
hai  harr-.essed  the  collecuve  wisdom  and  expenences  of  the  enure  Internet  community 
and  transformed  them  into  the  .Alexa  ser-'ice.  Tne  company  donates  a  copy  of  the  Web 
on  an  ongoing  basis  to  the  non-profit  Internet  .Archive,  which  is  endowed  to  preser-e 
our  digital  hentage  for  scholarly  access.  .Alexa  Internet  is  located  at  The  FYesidio  of 
San  Francisco.  PO  Box  29141.  San  Francisco.  California  94129-0141.  For  business 
nformation  call  415/561-6900,  fa.x  415/561-6795.  The  information  email  address  is 
nr'-".-  .\ie'-"i  ^-.'m.  Contacts:  Jennifer  Weissman.  .Antenna  Group.  415/977-191 1, 
e:inifer---.'.inepnarr  c:"m.  or  Cynthia  Lohr.  .Alexa  Internet.  415/56I-67S6, 


-••nrnig^-^  j.iex.-i.jom. 

Note  to  Editors:  .Alexa  is  a  trademark  of  .Alexa  Internet.  .All  other  trademarks  are 
the  procenv  of  their  resoective  owners. 

CONT.ACT:  Ale.xa  Internet  Cynthia  Lohr.  415/561-67S6 

^••■-•nia  "  "ilexa  .■'■m  or  .Antenna  Group  Jennifer  Weissman.  415/977-191 1 

^en-i:  -'e^'.^  inter: -larr  .-:Mn 
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INT)L'STRY  KE^-WORD:  COMPLTERS-ELECTRONICS  CO.MED 
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of  the  Los  .Angeles  Times  for  similar  stones.  You  will  not  be  charged  to  look 


for  stones,  only  to  retneve  one. 
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SEARCHING   FOR   INFORMATION 

IKs  A 
Jungle 

Out  There 
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BY  JEFF  UBOIS 

Monday  morning — your  first  day  back  at  work 
after  a  vacation  in  the  ti'opics.  The  trip  was  fal)- 
ulous.  Ijut  ^hat  is  that  nast\-  rash  under  your 
arm?  Trying  to  avoid  a  trip  to  the  doctor  you 
hop  on  the  Wei).  Giving  AhaMsta  a  whirl  you 
r\-]:)e  in  "jungle  rot. "  The  senice  goes  on  safari  in 
the  digital  jungle,  and  then  ...  argh.  information 
o\erload.  Alta  \ ista  has  found  39.7-tl  references 
to  jungle  rot.  W  hile  vou  scratch  uncontrollaljlv. 
you  lament,  "(ian't  anyotie  come  up  \\ith  a  better 
\\av  to  sort  and  retrieve  information?" 


ILLUSTRATION  BY  KIM  WILSON  BRANDT 
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Brewster  Kohle,  president  of  Alexo  Internet,  San  Francisco 


Fifty  years  ago,  presidential  science 
adviser  Vannevar  Bush  accurately 
predicted  the  rise  of  personal 
computers,  information  storage 
iind  seai'ch  tools,  and  electronic 
commerce  m  the  article  "As  We  May 
Thmk,"  published  in  the  July  1945  issue  of 
The  Atlantic  Monthly.  It  has  become  one 
of  the  most  wdely  cited  scientific  articles 
of  the  century. 

Bush,  who  oversaw  all  U.S.  wartmie  sci- 
ence efforts  (rncludrng  the  Manhattan  Pro- 
ject \voiTied  that  mcreasing  specialization 
would  soon  stall  scientific  progress.  Notmg 
that  'Menders  concept  of  the  laws  of  gen- 
etics was  lost  to  the  world  for  a  generation 
because  his  publication  did  not  reach  the 
few  %\-ho  were  capable  of  grasping  and  ex- 
tendmg  it,"  Bush  proposed  a  solution  called 
the  Memex.  He  envisioned  "a  device  in 
which  an  individual  stores  all  his  books, 
records  and  communications,  and  wliich  is 
mechanized  so  that  it  may  be  consulted 
with  exceeding  speed  iind  flexibihty." 

It  was  a  gieat  vision,  but  the  promise  of 
the  Memex  remains  unfulfilled.  While 
Apple  Computer  Inc.  triumphantly  an- 
nounced it   had   completed  the  Meiricx 
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when  it  shipped  HyperCard  more  than  a 
decade  ago,  the  modern  approximation  of 
the  device  is  the  sum  of  Internet  search 
tools  and  navigation  services. 


Punch  in  '^travel^' 
on  Digital  Equipment 
Corp.^s  AltaVista 
search  engine,  and 
you'll  get  more  than 
7  million  possible 
answers.  That's  noise, 
not  information. 

Compames  offering  these  sen.-ices  have 
created  billions  of  dollars  in  new  market 
capitalization  during  the  past  two  years 


and  are  the  core  busmess  of  close  to  a  frfth 
of  the  public  Internet  companies.  But 
these  companies  and  market  observers 
agree  that  information  searching  is  still 
harder  and  less  comprehensive  than  it 
should  be.  So  Internet  startups  and  more 
estabhshed  companies  are  investing  mil- 
lions of  dollars  in  search  research.  These 
companies  are  pursuing  a  variety  of  strate- 
gies, including  systems  that  provide  visual 
representations  of  data,  collaborative  fil- 
ters that  gather  recommendations  from 
users,  enteiprise  software  enhanced  with 
search  capabiUties  and  advanced  Web-search 
engmes. 

A  Difficult  Problem 

Search  services  have  unified  what  was  once 
a  sprawl  of  uncharted  hefdoms  on  the  In- 
temet.  But  despite  advances  m  processing 
and  storage,  the  underlving  approach  has 
remained  essentially  unchanged  for  years: 
Users  enter  a  few  descnptive  words  and 
hope  those  words  are  used  in  relevant  doc- 
uments. It's  fme  for  searchmg  through  a 
collection  of  a  few  thousand  items,  but 
punch  in  "travel"  on  Digital  Equipment 
Coip.'s  AltaVista  search  engine,  and  you'll 
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get  more  than  "^  million  possible  answers. 
That's  noise,  not  infonnation. 

The  explosion  of  data  available  on  the 
Web  is  aggi-avating  tlie  problem.  By  late 
1997,  there  were  more  than  640,000  sites 
and  100  million  pages  on  the  Web,  and  the 
number  continues  to  double  every  six 
months.  Without  fundamentally  new  ap- 
proaches, the  number  of  documents  re- 
turned by  a  genenc  search  engine  will  grow 
at  the  same  rate. 

It's  already  causing  changes  m  the  way 
browsers  operate.  "We're  looking  at  how  to 
tightly  couple  the  two  main  methods  for 
information  retrieval — search  and  brows- 
ing," says  Susan  Dumais,  a  guru  in  the 
field  who  was  recently  hrred  away  from 
Bellcore  to  become  a  senior  researcher  at 
Microsoft  Corp.  But  the  real  issue  is  deep- 
er. The  rapidly  widening  gap  between  the 
amount   of  data   m   the   world  and   the 


It's  not  hard  to  imagine 
tlie  commercial  potential  of 
the  ultimate  search  tool  or 
a  magic  relevance  detector. 


Jerry  Yang,  co-lounder  of  Yahoo  Inc.,  Santa  Clara,  Calif. 


amount  ol  attention  available  to  process  it 
means  a  .giowing  percentage  will  never  be 
looked  at  by  a  human.  So  the  importance 
o!  search  and  filtering  technologies  will 
increase,  even  as  new  media  types  expand 
the  challenge  beyond  the  realm  of 
straight  text. 

Searching  for  a  Solution 

"People  have  spent  many  millions  of  dollars 
trvTng  to  get  traditional  search  technology'  to 
work,"  says  Brewster  Kahle,  president  of 
San  Francisco-based  Alexa  Internet.  "But  as 
the  amount  of  infomiarion  and  the  number 
of  people  using  it  goes  up,  these  traditional 
approaches  are  failing,  and  we  need  a  new 
genre  of  answer." 

Several  dozen  companies  believe  they've 
found  it.  Though  they  take  an  amazing 
variety  of  forms,  business  models  and 
technical  approaches,  they  all  promise  to 
detemime  the  relevance  of  information  for 
their  users.  Generally,  they  fall  into  three 
camps:  free,  ad-supported  gateways  to  the 
Web;  technical  innovators;  and  enterprise- 
software  companies. 

A  few  of  them  will  wm 
big,  as  Yahoo  Inc.  already 
has.  But  most  are  fated  to 
be  acquired  by  big  compa- 
nies if  things  go  well,  or 
to  close  their  doors.  Search, 
filtering,  personalization 
and  agents — or  whatever 
technologies  eventually 
dominate — are  too  impor- 
tant to  remain  in  the  hands 
of  small  companies.  Big 
companies  need  to  own  the 
best  version  of  search,  and 
they  can  afford  to  buy  it. 

It's  not  hard  to  miagme 
the  commercial  potential  of 
the  ultimate  search  tool  or 
a  magic  relevance  detector. 
Terabytes  of  unstructured 
mformanon  could  be  ordered 
and  made  accessible,  dupli- 
cation of  effon  would  be 
reduced  or  eliminated,  and 
compames  could  begm  to  get 
a  return  on  unused  informa- 
tion assets.  And  if  the  magic 
relevance  detector  were  a 
channel  of  some  kind,  tuned 
perfectl>-  to  the  user,  why 
would  anyone  ever  want  to 
switch  It  off; 


Portals  and  Gateways 

Start  with  thai  last  question  hrst  I'roviding 
a  portal  or  gateway  to  other  sites  rather 
than  generating  original  content  made 
Yahoo  and  AltaVista  mimediate  successes 
and  IS  the  core  business  of  Infoseek  Corp.  of 
Sunnyvale,  Calii.;  Excite  Inc.  of  Redwood 
City,  Calif.;  Lycos  Inc.  of  Framingham, 
Mass.;  Inktomi  Corp.  of  San  Mateo,  Calif.; 
and  others.  But  anyone  can  be  a  gateway, 
and  several  of  the  early  Web  mdices  have 
either  been  shut  down  or  acquired. 

To  create  a  sustainable  model,  Web 
directones  and  search  engines  are  evolvmg 
mto  media  brands  that  create  destinations 
rather  than  ser\'e  as  gateways.  Then  new 
competition  is  not  just  America  Onlme 
Inc.,  but  television  and  newspapers. 

"People  have  to  realize  the  Web  has 
grown  so  much  since  the  search  days,  and 
there  is  a  need  to  expand  beyond  that  one 
function,"  says  Jerry  Yang,  co-founder  of 
Yahoo.  "What  we  do  is  a  media  game,  not  a 
software  game,  and  search  is  only  a  com- 
plement to  the  overall  service,  not  the 
dominant  element." 

George  Bell,  president  and  CEO  of 
Excite,  echoes  these  sentiments.  "Techno- 
logy enables  everything  we  do,  so  it  is  enor- 
mously important,  but  this  busmess  is 
about  building  and  growing  brands,"  BeD 
says.  "I'm  less  curious  about  the  search 
market  and  more  curious  about  how  to  use 
the  search  technolog)'  to  bring  a  more  use- 
ful and  convement  experience."  Innovators 
m  the  Web-wide  search  busmess  are  trying 
to  improve  results  by  pro\'iding  bener  cate- 
gorization of  search  results,  automatic  sug- 
gestions of  new  terms,  improved  "refine" 
features  that  make  it  easy  to  tr\-  again  and 
the  use  of  personalized  pages  that  are  essen- 
tially saved  searches. 

Elite  Content 

Rather  than  make  searches  more  exhaus- 
tive, some  beheve  the  quickest  way  to 
improve  the  quality  of  search  results  is 
to  index  only  top-quality  content.  "It's 
strange  that  a  lot  of  the  differentiation  in 
the  search  engines  is  'I  mdex  the  most  doc- 
uments,'" says  Steve  Krause,  president  o! 
Aifmicast  Corp.  of  San  Francisco,  which 
makes  server  software  that  matches  reader 
mterest  with  ads. 

One  company  working  wixh  premium 

content  is  Northem  Light  Technolog>-  LLC 

of  Cambridge,  Mass.,  which  has  combined 

continued  on  page  146 
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a  Web-wide  search  cnguic  with  automatic 
catcgdnzation.  Traffic  on  the  service  (www. 
nlsearch.com)  is  growing  at  20  percent  per 
week,  with  80,000  to  90,000  searches  per 
day  as  of  late  December  1997.  "We  are  a 
search  engine,  but  we  have  this  unique  clas- 
sification scheme  we  apply  to  results,  and 
we  offer  access  to  premium  material,"  says 
marketing  manager  Leslie  Ray.  "A  lot  of 
high-quality  information  from  the  publish- 
mg  world  on  services  such  as  [Reed  Elsevier 
Inc.'s]  Lexis-Nexis  or  [Dialog  Corp.'s|  Dialog 
has  been  inaccessible  to  the  general  public." 

Excite  is  doing  something  similar,  index- 
ing 300  to  400  content  sites  three  to  four 
times  a  day.  Bell  says  it  means  "80  percent 
of  the  value  you  get  from  Lexis-Nexis  you 
get  from  us  for  free." 

While  Umiting  the  collection  of  indexed 
documents  may  provide  an  interim  solu- 
tion, others  are  pursuing  fundamentally 
new  technologies. 

Visualizing  New  Categories 

Three  classic  science  fiction  stories  about 
the  Net — True  Names  and  Other  Dangers 
( 198 1 )  by  Vemor  Vinge,  Neuromancer  { 1984) 
by  William  Gibson  (which  coined  the  word 
"cyberspace")  and  Snow  Crash  by  Neal 
Stephenson  (1993) — all  describe  user  inter- 
faces based  on  3D  representations  of  large 
data  networks. 

No  one  has  yet  built  anythmg  that 
matches  the  richness  of  the  environments 
described  in  those  hterary  works,  but  com- 
panies such  as  Lucent  Technologies  Inc.'s 
Visual  Insights;  ThemeMedia  Inc.  of  Red- 
mond, Wash.;  Perspecta  Inc.  of  San  Fran- 
cisco; Semio  Corp.  of  San  Mateo,  Calif.;  and 
Xerox  Corp.'s  Inxight  are  trying  to  tap  into 
native  human  capacities  to  deal  with  3D 
space.  Instead  of  answering  queries  with 
hsts  of  documents,  these  compames  are 
building  multidimensional,  immersive 
environments  that  provide  a  more  intuitive 
view  of  large  collections  of  data  grouped 
or  clustered  by  meaning. 

What's  immediately  striking  about  these 
offerings  is  their  visual  presentation.  A  3D 
data  representation  can  look  Hke  anything 
from  a  topograpliic  map  to  a  bike  wheel  with 
100  sp<ikcs.  But  behind  the  pretty  pictures, 


they  all  rely  on  proprietary  mediods  of  cat- 
egorizing documents  iind  search  results.  Just 
as  categorization  makes  the  difference 
between  a  well-ordered  library  and  a  ran- 
dom heap  of  Ixx^ks,  the  value  of  the  pictures 
these  companies  provide  depends  on  their 
ability  to  group  search  results  into  mciin- 
in.gful  categories. 

A  newcomer  in  this  area  is  Visual  In- 
sights, an  independent  unit  of  Lucent,  which 
unveiled  a  new  set  of  products  in  December 
for  visuiiUzing  semistmctured  data  as  well 
as  information  generated  by  automated  sys- 
tems. "There  has  been  an  explosion  in  the 
ability  to  collect  data,  but  not  to  analyze  it  or 
take  action,"  says  James  Weichel,  president 
of  the  company.  "A  more  challenging  prob- 
lem is  visualizing  data  without  a  physical 
or  geographic  metaphor." 

Weichel  offers  a  striking  example.  Ima- 
gine trying  to  numerically  describe  a  chil- 
dren's game  in  which  players  alternately 
choose  numbers  from  one  to  nine  in  an 
attempt  to  get  three  of  them  that  add  up  to 
15.  Each  number  may  only  be  chosen  once 
during  the  game,  so  player  A  might  start 
by  choosing  seven  and  player  B  by  choos- 
ing eight;  player  A  then  chooses  six,  and 
then  player  B  would  have  to  choose  two; 
player  A  would  respond  with  5  to  prevent 
B  from  winning. 

The  variations  are  numerous  and  hard  to 
solve — until  you  realize  this  scenario  corre- 
sponds exactly  to  tic-tac-toe. 

Weichel  says  similar  leaps  from,  data  to 


A  3D  data  repre- 
sentation can  look 
like  anything  from 
a  topographic  map 
to  a  bike  wheel 
with  100  spokes. 

pictures  are  equally  useful  in  a/eas  such  a^ 
network  management.  Year  2000  problerr 
analysis,  data  mining,  caU-center  operarior 
and  fraud  detection.  Much  of  the  software 
and  intellectual  property  now  owned  b\ 
Visual  Insights  (the  company  has  nini. 
patents)  has  been  under  development  foi 
several  years,  and  the  company,  whicl 
employs  about  40  people  (more  than  an- 
of  the  other  visualization  companies!,  i 
likely  to  eventually  leave  the  shelter  o 
Lucent  through  an  IPO. 

Like  others  in  the  search  industry  (sucl 
as  Verity  Inc.  of  Sunnyvale,  Calif.,  and  Ap 
tex  Software  Inc.  of  San  Diego),  Theme 
Media  has  its  roots  in  the  U.S.  intelligenc 
community.  Founded  by  researchers  fror 
the  PaciHc  Northwest  National  Laborator 
ThemeMedia  is  led  by  supercomputer  gur- 


How  big  is  your  company's  year  2000  problem?  In  the  Visual  Insights  screen  grab  above,  each  wheel  represents 
a  software  system,  such  as  payroll.  Spokes  represent  individual  programs — red  spokes  indicate  programs  with  Ye 
2000  bugs.  By  clicking  on  the  end  of  a  spoke,  a  programmer  can  view  and  repair  the  actual  code. 
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Gary  Smaby,  who  serves  as  CEO,  and  tor- 
mer  Cray  Research  CEO  John  Rollwagen, 
who  IS  chairman  ot  the  board. 

"The  best  cognitive  processor  is  between 
your  ears,  and  a  visual  metaphor  means 
that  trends  are  patterns  that  can  be  seen," 
says  Steve  Ardire,  ThemeMedia's  senior 
director  for  business  development.  Though 
it  isn't  shipping  any  products  yet,  Theme- 
Media  plans  to  make  money  selling  server 
software  and  providmg  free  chent  software 
for  viewing.  "We  have  a  core  offering  that 
could  be  a  front  end  for  an  Infoseek  or  a  col- 
laborative filtering  company  such  as  Net 
Perceptions  or  Autonomy,"  Ardire  says. 

The  hope  is  to  go  beyond  the  limits  of 
Boolean  searching,  which  is  based  on  partic- 
ular terms  and  logical  operators  such  as 
AND,  OR  and  NOT.  "The  problem  widi 
Boolean  is  that  users  have  to  know  specific 
facts  about  documents  before  they  conduct  a 
search,"  Ardire  says.  "[Our  technology]  lets 
you  drill  down  deeper  to  reveal  new  data, 
and  none  of  the  Boolean  offenngs  allow  you 
to  do  that." 

Another  company  using  visualization  is 
Perspecta.  "If  you  know  exactly  how  to  de- 
fine a  query,  a  search  engine  is  usefid,"  says 
President  and  CEO  Steve  Holtzman.  "We 


arc  aiming  at  the  other  90  percent  who 
can't  define  exactly  what  they  want,  and  we 
allow  them  to  dynamically  reorganize  the 
data  depending  on  how  they  want  to  see  it" 
(for  example,  reordering  a  news  database 
according  to  date,  topic  or  source). 

Founded  in  January  1996  by  MIT  Media 
Lab  alumni,  Perspecta  has  received  funding 
from  Encyclopaedia  Britannica  and  Infor- 
mix Corp.  and  counts  these  companies,  as 
well  as  The  Sabre  Group,  as  customers. 

Visuahzation-software  companies  share 
the  abiUty  to  present  an  immediate  and  in- 
tuitive connection  between  different  clus- 
ters of  data  and  to  relate  those  to  a  user's 
position  in  a  3D  environment.  But  it's  not  the 
only  new  approach. 


George  Bell,  president  and  CEO  of  Excite  Inc., 
Redwood  Oty,  Calif. 


James  Weichel,  president  of  Visual  Insights,  a  unit  of  Lucent  Technologies. 
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Collaborative  Filters 

How  do  you  know  what  movie  to  see,  what 
restaurant  to  eat  at,  or  what  book  to  read;  If 
you're  like  most  people,  you  rely  on  the  rec- 
ommendations of  peers.  The  Net  offers  a 
unique  way  for  individuals  to  pool  their  col- 
lective wisdom  through  collaborative  filters, 
or  recommendation  systems.  A  collabora- 
tive filter  can  allow  a  Web  site  to  offer 
restaurant  ratings,  for  example.  When  peo 
pie  visit  the  site,  they  are  asked  to  rate  a  few 
restaurants,  and  the  system  then  finds 
others  with  similar  preferences  and  makes 
suggestions  about  places  to  eat. 

Collaborative  filters  vary  by  the  type  of 
subject  matter  they  deal  with,  whether 
they  match  people  to  people  or  people  to 
things,  and  whether  they  collect  data 
actively  and  expUcitly,  passively  and  im- 
plicitly, or  in  combination.  But  they  all 
provide  answers  to  questions  and  predict 
what  a  given  person  will  like  by  condens- 
ing opinions  from  thousands  of  users. 
Unlike  search  engines,  collaborative  filters 
learn  about  the  preferences  of  individual 
users,  and  they  can  offer  recommendations 
about  things  that  can't  be  indexed,  such  as 
video,  for  example. 

There  are  a  several  companies  and  prod- 
ucts in  this  space,  including  Net  Percep- 
tions Inc.'s  GroupLens  Recommendation 
Engine,  which  helps  companies  match 
individual  customers'  tastes  and  is  backed 
by  The  Paul  Allen  Group  and  Hummer 
Wmblad  Venture  Partners,-  Autonomy  Inc.'s 
Agentware  i3,  which  helps  Web  publishers 
deliver  targeted  content;  LikeMinds  Inc.  of 
San  Francisco,  which  focuses  on  one-to-one 
marketmg  and  has  a  movie  recommenda- 
tion demo;  Gustos  Software  LLC  of  Laguna 
Hills,  Calif.,  which  lets  users  rate  Web  sites; 
and  Pittsburgh-based  Wise  Wire  Corp.'s 
Wise  Wire  information  service. 

But  the  most  prominent  is  Firefly  Net- 
work Inc.,  a  Cambridge,  Mass.-based  com- 
pany founded  in  1995  by  another  group 
of  MIT  Media  Lab  researchers.  It  was  first 
out  of  the  gate  with  a  music  recommen- 
dation system  (later  sold  to  a  startup  called 
Launch).  "We  are  moving  toward  a  model 
of  computing  and  information  retrieval  diat 
is  based  around  the  user  instead  of  an  oper- 
atmg  system  or  datacentric  model,"  says 
Saul  Klem,  Firefly's  senior  VP  of  corporate 
brand  and  strategy. 

Firefly  has  fallen  short  of  some  of  its  early, 
perhaps  ovennflated,  hopes — the  company 
was  supposed  to  be  public  by  now  but  isn't. 
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In  addition,  die  math  behind  collaborative 
filtering  is  easy  to  duplicate,  which  has 
split  the  market  among  many  vendors, 
and  the  killer  app  (or  the  killer  data  set)  for 
collaborative  filtering  has  been  elusive. 

"I  don't  want  to  knock 
collaborative  filtering — we 
use  it, "  says  Yang  of  Yahoo, 
which  is  using  Firefly  in  its 
My  Yahoo  service.  "Firefly 
has  done  a  good  job  with 
movies  and  music,  but  our 
focus  now  is  to  look  at  such 
technology  to  improve  our 
content  targeting  and  adver- 
tising targeting."  Striking  a 
balance  between  active/ex- 
plicit and  passive/imphcit 
search  and  navigation  hasn't 
been  done.  It's  an  area  ripe 
for  new  solutions,  and  one 
of  the  more  intriguing  is 
Alexa  Internet. 

The  Alexa  service  is  de- 
livered via  a  client  that  looks 
like  an  additional  toolbar 
for  the  Netscape  or  Internet 
Explorer  browsers.  Alexa  dis- 
plays information  about  what- 
ever site  a  user  is  accessing, 
lets  tisers  vote  on  site  quality  and  offers  sug- 
gestions about  where  to  go  next. 

Alexa's  information  is  generated  by  end 
users,  from  third-party  information  pro- 
viders (including  Encyclopaedia  Britannica) 
and  from  an  analysis  of  the  company's  copy 
of  the  entire  Web.  By  gathering  copies  of  all 
pubUcly  accessible  sites  (a  collection  that 
is  now  8  terabytes  and  growing),  Alexa  can 
analyze  the  link  structure  of  the  Web  to  de- 
termine which  pages  are  relevant  to  each 
other,  and  allow  users  to  request  archived 
Web  pages  when  a  "404 — File  Not  Found" 
error  occurs.  Data  gathered  by  Alexa  is 
donated  to  the  nonprofit  Internet  Archive 
(www.archive.org)  for  long-term  safekeeping. 

"If  you  take  a  path  through  the  woods, 
you  beneht  from  the  exploration  people 
have  done  before  you  in  finding  the  best 
way  up  the  mountain  or  down  to  the  lake," 
says  Alexa's  Kahle.  "We  do  the  same  thing 
for  the  Web.  When  your  browser  hits  a  URL, 
the  toolbar  requests  information  about  it 


from  Alexa's  servers,  and  we  record  that  an 
Alexa  user  has  spent  time  at  that  site  as  a 
kind  of  vote." 

What  Alexa  has  m  common  with  the 
collaborative  filtering  ;ind  recommendation 
systems  is  an  approach  based  on  metadata, 
a  loosely  defined  term  that  is  rapidly  be- 
coming a  buzzword  used  to  descnbe  any 
data  about  other  data.  A  call  number  in 
a  Ubrary,  a  keyword  descnbing  an  article 


Brett  Newbold,  president.  Open  Text  Corp.,  Waterloo,  Ontario. 


"H  you  take  a  path 
through  the  woods, 
you  benefit  from  the 
exploration  people 
have  done  before 
you.  ...  We  do  the 
same  thing  for  the 
Web/'  says  Alexa's 
Brewster  Kahle. 


and  author  tags  in  HTML  arc  all  examples 
of  metadata. 

Metadata  is  becoming  increasingly  im- 
portiint  because  it  enables  searches  through 
content  that  can't  be  indexed  easily,  as  well 
as  through  enormous  collections  of  d;itii. 
Metadata  has  become  a  major  focus  for  the 
World  Wide  Web  Consortium,  which  is  de- 
veloping a  new  standard  called  XML.  This  is 
a  specification  for  defining  new  markup  lan- 
guages to  meet  the  future 
needs  of  Web  documents 
and  Web  searching. 

Enterprise 
Software 

Companies  such  as  Venty; 
Fulcrum  Technologies  Inc  of 
Ottawa,  Ontario;  Compass- 
Ware  Development  Inc.  of 
New  York;  Quadralay  Corp. 
of  Austin,  Texas;  Personal 
Library  Software  (PLS)  Inc. 
of  Rockville,  Md;  Open  Text 
Corp.  of  Waterloo,  Ontario; 
Open  Market  Inc.'s  FoUo 
unit;  and  Excalibur  Techno- 
logies Corp.  of  Vienna,  Va, 
are  pursuing  opportunities 
created  by  corporate  intra- 
nets. They're  competing  with 
offerings  from  the  public 
search  engine  companies,  in- 
cluding AltaVista,  Infoseek, 
Lycos  and  Excite,  all  of  which 
are  Licensing  the  software  they  use  to  support 
their  public  sites. 

Like  the  "elite  content"  companies, 
these  firms  know  that  the  value  of  the  data 
being  searched  determines  the  value  of 
search  applications.  And  because  data 
inside  companies  is  extremely  valuable,  the 
tools  that  access  this  data  are  valuable,  too. 
One  of  the  most  established  companies  in 
this  space  is  Verity,  which  emphasizes  the 
work  it  has  done  to  build  interfaces  to  other 
vendors'  database  products  and  is  hoping  to 
distance  itself  from  low-end  search  prod- 
ucts. With  dozens  of  free  or  low-cost  search 
tools  available.  Verity  wants  to  stand  out 
from  the  crowd  by  building  connections  into 
Lotus  Notes  and  other  large  databases,  not 
by  adding  a  more  sophisticated  user  inter- 
face. "Search  and  retrieval  is  fragmenting 
into  two  or  more  primary  camps:  commod- 
ity personal  search  and  tools  for  corporate 
use,"  says  Ronald  Weissman,  Verity's  VP 
of  marketing. 
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Coinnioditi2<ition  is  an  issue,  but  the 
most  senous  threat  to  the  small  and  mid- 
size search  companies  is  competition  from 
IBM  Corp.,  Oracle  Corp.  and  Microsoft,  all 
of  which  are  adding  functions  to  existing 
products  and  developing  new  ones. 

The  stock  prices  tell  the  story.  By  the  end 
of  1997,  Fulcrum  had  dropped  from  a  1996 
high  of  $20.50  to  less  dian  $2;  Verity  had 
plunged  from  more  than  $50  in  spring  1996 
to  about  $5;  and  Excalibur  had  recovered  to 
about  half  the  value  it  had  in  early  1997. 

"Verity,  PLS,  Fulcrum  and  similar  com- 
panies are  having  a  hard  time  because 
search  by  itself  doesn't  solve  anyone's  real 
problem.  There  is  much  more  to  do,"  says 
Open  Text  president  Brett  Newbold.  "How 
do  you  compete  with  Oracle  when  Oracle 
says  it  is  free,  or  Microsoft  makes  it  a  free 
part  of  Exchange?"  Smaller  companies  are 
ninning  for  cover  by  creating  focused  appli- 
cations. "Every  [company  that]  was  any- 
where near  this  space  is  now  calling  itself  a 
'knowledge  management'  company," 
Newbold  says.  Knowledge  management  is 
evidently  on  its  way  to  becommg  the  total 
quality  management  or  re-engineering  of 
the  late  1990s — even  Open  Text  is  describ- 
mg  itself  as  a  "collaborative  knowledge 
management  company." 

Customer  service  is  another  good  candi- 
date application  in  this  area.  Paul  McGraw, 
executive  VP  and  co-owner  of  APS  Tech- 
nologies, a  Kansas  City,  Mo.,  supplier  of 
SCSI  and  other  penpheral  products,  says  his 
company  would  like  to  find  a  way  to  cate- 
gorize and  access  the  knowledge  contained 
in  the  problem -resolution  notes  taken  by 
technical  support  representatives.  APS  has 
used  an  extensive  coding  scheme  to  key 
into  typical  problems,  but  tapping  the 
megabytes  of  text  notes  is  a  daunting  task. 

"A  simple  search  may  not  find  parallel 
port  cables  with  SCSI,  or  a  machine  that 
freezes  may  not  be  diagnosed  correctly  as 
having  a  SCSI  problem,"  McGraw  says. 
APS  has  worked  with  software  from  FoUo 
to  provide  some  context-sensitive  help,  but 
its  greatest  successes  in  diagnosing  prob- 
lems have  come  from  manually  analyzing 
the  data,  or  from  running  SQL  queries 
against  an  AS/400  database. 

Clearly,  there  is  room  for  improvement 


and  a  market  for  enterprise  tools.  But  ad- 
vances in  this  iirea  will  depend  on  the  evo- 
lution of  new  sciirch  technology. 

Future  Search 

Yahoo  has  shown  that  ads  can  support  nav- 
igation services  and  that  brandmg  can  be  as 
important  as  technology.  If  there  are  no 
great  leaps  forward  in  search  technology, 
refining  the  art  of  human  assistance  will 
be  the  next-best  thing,  and  the  art  of  build- 
ing search  services  will  depend  on  know- 
ing where  to  add  human  help,  as  Yahoo 
has  done. 

"We  don't  believe  there  is  this  magic 
technology  to  make  things  much  better  or 
worse,"  Yang  says.  "Anything  we  adopt  has 
to  address  80  percent  of  the  users.  If  it's 
usefid  for  only  a  few,  it's  not  our  business 
and  we  can't  support  it." 

For  any  technology  player  that  hopes  to 
win,  Yang  adds,  an  offering  "not  only  has  to 
scale,  but  also  must  be  compelling  enough 
that  lots  of  people  want  to  use  it,  and  most 
have  failed  at  either  or  both."  Still,  Digital 
proved  the  importance  of  technical  bnite 


Commoditization  is 
an  issue,  but  tiie  most 
serious  threat  to  tiie 
small  and  midsize 
search  companies  is 
competition  from 
IBM  Corp.,  Oracle 
Corp.  and  Microsoft. 

force  when  its  AltaVista  service  increased 
the  value  of  the  Web  by  an  order  of  magni- 
tude overnight.  New  ways  to  cluster  and 
view  search  results  and  to  iterate  searches 
could  do  the  same.  Standards  to  allow 
continued  on  page  155 


Digging  for  Data:  Where  and  How  to  Look 


Some  of  the  frustration  with  search 
engines  can  be  traced  to  the  fact  that 
most  users  don't  have  a  clue  how  to 
use  the  technology  effectively.  For 
starters,  entering  just  one  word  into  a 
search  engine  is  like  using  a  backhoe  to 
pick  out  a  splinter.  Search  engine  oper- 
ators say  that  among  those  rare  users 
who  enter  more  than  a  single  term,  few 
think  carefully  about  synonyms  or  are 
adept  at  making  queries  using  Boolean 
logic— for  example,  "A  AND  (B  OR 
C)."  Among  those  who  do  understand 
Boolean  logic,  many  don't  take  the 
time  to  read  the  directions  for  each 
search  engine's  command  syntax. 

But  because  search  technologies  are 
still  stupid,  users  have  to  be  smarter 
about  making  them  work  and  about 
zeroing  in  on  the  good  stuff  by  using 
several  engines.  One  option  is  to  use 
metasearch  services — such  as  www. 
mamma.com,  www.metasearch.com  and 
www.search.com — which  combine  re- 
sults from  more  than  one  database. 
Search  experts  also  suggest  that  those 
digging  for  data  focus  their  search  re- 


quests, find  unique  identifiers  associated 
with  the  topic  (proper  names,  geogra- 
phy, dates  and  jargon),  play  with  syno- 
nyms and  keep  tiying. 

The  best  searches  work  by  iteration, 
and  sometimes  by  coming  at  the  ques- 
tion from  a  different  angle  or  by  using 
a  different  set  of  data  (Usenet  archives 
rather  than  the  Web,  for  example). 
Maybe  the  right  answer  isn't  indexed 
on  the  Web  but  can  be  found  within  a 
vendor's  Web  site.  The  trick  is  to 
know  how  and  when  to  broaden  a 
search  and  when  to  narrow  it. 

You  may  also  want  to  consider  using 
a  fee-based  service.  Offerings  include 
Northern  Light  Technology's  service 
(www.nlsearch.com)  and  Infonautics' 
Electric  Library  (www.elibrary.com).  Or 
if  you  want  to  pay  through  the  nose, 
try  Dialog  Corp.'s  Dialog  service  (www. 
dialog.com)  or  Reed  Elsevier  Inc.'s  Lexis- 
Nexis  (wv/w.lexis-nexis.com).  These  last 
two  services  charge  for  searches  based 
on  the  Ubraries  accessed,  the  connect 
time  required  and  the  number  of  docu- 
ments printed.  — /.  U. 
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Search  Sectors 

Notable  players  in  four  categories  of  software 

The  only  way  to  categorize  the  companies  developing  search  technology  is  to  be  arbitrary  about 
it.  The  boundaries  between  different  segments  are  blurry  (Excite,  Infoseek  and  other  Web  indices 
sell  their  software  for  use  on  intranets),  and  every  successful  new  company  will  end  up  competing 
against — or  possibly  being  acquired  by — the  Hkes  of  Microsoft,  IBM,  Oracle  and  Netscape. 

— Michael  Mattis  and  Jeff  Uhois 


Traditional,  Ad-supported  Search  Engines  and  Directories 


"In  terms  of  aggregate  traffic,  od  revenue  and  brand 
recognition,  Yahoo  is  clearly  the  leader  in  the  search  and 
directory  spoce.  For  a  long  time  it  hos  been  a  two4iorse 
race  with  Yahoo  still  well  ahead  of  Excite.  This  spoce  is 
no  longer  about  search,  as  the  technology  of  search  has 
essentially  become  a  commodity  (witness  the  fact  Ifiat 
Yahoo  even  licenses  search  technology  from  AltaVista). 
Instead  of  trying  to  implicrtly  compete  with  one  onottier, 
search  ond  directory  companies  ore  starting  to  oggres- 
sively  pursue  the  same  oudience  as  AOL — they  are 
competing  witti  online  services  (as  news  and  general 
information  sites).  AOl  has  even  sold  its  equity  stoke  in 
Excite,  OS  they  now  see  the  compony  os  a  competitor." 
— Ponick  Keane,  kpitet  Communicalions  LLP 


Yahoo  Inc. 

Lofation:  Santo  Clara,  Calif. 
URL  www.yahoo.com 
Ownership:  Public  (Nasdaq:YHOO) 
Mom  ProAnt:  Yohoo  (Yet  Another 
Hierarchical  Officious  Oracle)  software 
locates,  identifies  and  sorts  moterid  stored 
on  the  Internet. 

Excite  Inc. 

UKotim:  Redwood  Gty,  Colrf. 
URL  www.excite.com 
Ownership:  Public  (Nosdoq:  XCIT). 
(Intuit  Inc.  recently  invested  $40 
million  in  Excite.) 

Main  ProdiKt:  Excite  ICE  (Intelligent 
Concept  Extraction)  seorcfies  the  Web  and 
Usenet  for  documents  containing  keywords 
and  related  corKepts. 


Infoseek  Corp. 

Loiotiofl:  Sunnyvale,  Calif. 
URL  www.infoseek.com 
Ownership:  Public  (Nosdoq:  SEEK) 
Main  Product:  Infoseek  Seorch  searches 
the  Web,  specific  sites,  URb,  titles  or  links 
without  Booleun  operators. 

ly<os  inc. 

LocatiMcr 
URL  www.lycos.com 
Owner^:  Public  (NosdoqiCOS) 
Moin  Prodott:  lycos  Search  locates, 
retrieves  and  monoges  information  stored 
on  the  Net. 


Inktomi  Corp. 

Location:  Son  Moteo,  CoM. 
URL  www.inktomi.com 
Ownership:  Private 
Mam  Product:  Inktomi  Seorch 
Tedinobgy  is  a  scalable  engine  for  ■torch- 
ing and  retrieving  infomration  stored  on  the 
Internet.  It  poweR  such  services  as  Wireds 
HotBot. 

AltaVista  Search 
Network  Inc. 

location:  Polo  Alto,  Calif. 
URL  http.y/altovisto.digitiicom 
Ownership:  Digitol  Eqwpmert  Corp. 
(NYSE:  DEO 

Main  Product:  AltaVista  Pubfo  Seorch 
aggregates  and  retneves  data  from  the 
Internet.  AltaVista  licensing  indudes  deols 
withTPiaelefonicaPublicidode 
Informocion,  Spain)  ond  YiAoo  Inc 


Recommendation  Systems/Collaborative  Filtering 


"The  players  in  this  emerging  market  consist  of  technol- 
ogy encblers,  which  are  attemphng  to  creofe  one-to-one 
morkehng  solutions,  and  full-service  solutions  providers, 
which  are  not  only  creohng  tools,  but  also  are  creahng 
the  platforms  to  allow  content  and  commerce  compa- 
nies to  take  personalizahon  and  its  opportunihes  to  the 
next  level. 

"In  this  pock,  firefly  has  the  best  vision  of  on  end- 
to-end  solution.  Since  its  incephon,  the  company  has 
shown  how  its  product  can  moke  communihes  come 
alive  in  a  number  of  settings.  In  particular.  Firefly  has 
shone  in  a  number  of  music  and  medio-rich  applications. 
[Among]  others  that  have  the  guile  and  experience  to 
stand  out  [is]  Alexo,  the  brainchild  of  the  folks  behind 
WAIS  and  NetPerceptions." 

— Allen  Weiner,  Dataquesi  Inc. 


Firefly  Network  Inc. 

Location:  Cambridge,  Mass. 

URL  www.firefly.net 
Ownership:  Private 
Main  Product:  Firefly  Catalog  Navigator 
peBonolization  software  captures  and  adds 
preference  and  general  interest-level 
information  about  client  businesses  and 
their  products  according  to  customers' 
personal  profiles. 

Net  Perceptions  Inc. 

Location:  Minneapolis 

URL  www.netperceptions.com 
Ownership:  Pnvofe 
Main  Product:  GroupLens 
Recommendation  Engine  tailors  o  client 
Web  site's  content  to  each  of  the  site's 
customers. 

LikeMinds  Inc. 

Location:  Son  Froncisco 

URL  www.likemindscom 
Ownership:  Private 
Main  Product:  WebSell  is  a  colloborafive 
filtering  ond  piedicrive  modeling  technology 
that  taps  customer's  behavior  data  based 
on  their  online  interactions.  It  then  identifies 
product  selections  that  have  the  "best 
chance"  of  being  purchased  by  targeted 
customers. 


Alexa  Internet 

Location:  Son  Francisco 

URL  www.alexa.com 
Ownership:  Private 

Main  Product:  Alexo  Internet  is  delivered 
vio  a  client  thot  ioob  like  an  additional  tooE 
bor  for  the  Netscape  or  Internet  Explorer 
browsers.  Alexo  displays  informotion  about 
the  site  a  user  is  accessing,  lets  users  vote 
on  site  quaFitY  and  offers  suggestions  about 
where  to  go  next  based  on  ttie  sites  this 
user,  or  others  like  him  or  her,  have  visited. 
Alexo's  information  is  generated  by  end 
users,  from  third-party  infoimatton  provide5, 
including  the  Encyclopaedia  Bntannico,  and 
hom  an  analysis  of  the  company's  copy  of 
the  entire  Web. 


Inc. 

location:  Son  Francisco 

URL  www.imana.com 
Ownership:  Private 

Main  Product:  Using  agent  technology, 
Imono's  Tolismon  software  enables  product 
ond  service  companies  to  offer  personalized 
information  delivery,  targeted  odvertising 
and  self-forming  online  communities. 


WiseWire  Corp. 

location:  Pittsburgh,  Pa. 

URL  www.wisewire.com 
Ownership:  Private 

Main  Product:  Using  wfnt  Itie  compony 
colls  smart  filtering,  WiseWie  for  Web  Sites 
integrates  into  a  business's  Web  site,  linking 
it  to  updated,  teol-time  informotion  specific 
to  a  business's  objectives  in  order  to  build 
community  around  shared  interests. 

Gustos  Software  LLC 

Location:  Laguna  Hills,  Cofif. 
URL  www.gustos.com 
Ownership:  Private 
Main  Product:  Gustos  Guide  is  a 
recommendation  engine  for  commerce  and 
od  serve5.  Using  collaborative  filtering, 
it  lets  users  rate  Web  sites. 
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Enterprise/Intranet  Software 


"While  Verity  raced  to  an  eorfy 
ud  OS  the  premier  search 
engine  for  enterprise  Web  sites, 
it  is  now  fodng  stiff  competition 
from  mdtiple  souree.  Moreover, 
with  the  Mlvent  of  Microsoft's 
Index  Server  as  a  core  compo- 
nent of  0  free  NT  Web  server, 
basic  hjikext  retrievol  is  rapidly 
'ecoming  a  commodity  service 
1  cyberspace.  Needed  ate 
network<entric  solutions  that 
offer  superior  precisiwi  and 
recJ,  with  a  minimum  of 
administrative  overiieod." 

— BeofheyBod, 
Patrida  Seybold  Group 


'^j  Visualization 

"Data  visuolization  is  the  latest 
emerging  trend  for  analysis  of 
large  volumes  of  detailed  doto. 
Current  grid  and  chart  presento- 
tion  methods  are  too  limited  for 
viewing  complex  result  sets 

1  many  different  variables, 
r  jn  adjunct  for  reporting  and 
data  mining,  visualization  tools 
give  the  end  user  the  obility  to 
comprehend  tens  of  thousands 
of  items  in  o  single  view.  We 
see  visualization  tools  and  com- 
ponents as  a  growing  morket 
f^wr  the  next  two  years.  [How- 

',]  it  is  too  eorty  to  see  a 
ciear  front-runner  in  this  emerg- 
ing sector." 
—don  Madavisli,  Meta  Gioup  Inc. 


Verity  Inc. 

LcKation:  Sunnyvoie,  Cdif. 
URL  www.veritv.com 
Ownenl^):  Public 
(Nosdoq:  VRIY) 

MomProdod:Sefl(d)'97"BO 
scoloble,  customizoble  knowledge 
retrievol  solution  for  ( 
and  novigoting  enterprise 
1  resources. 


MKrosoft  Corp. 

LMOtioR  Redmond.  Wash. 
URL  www.microsoft.com 
Ownenhiii:  Public 
(Nasdoq:  MSFT) 

Nhmi  Prodod:  Microsoft  index 
Saver  is  a  fulHext  retrievol 
engine  that  comes  included  with 
Miaosoft's  NT  Web  server 
software. 

Extalibur 
Technologies  Corp. 

Lixotion:  Vienno,  Vo. 

URL  www.excalib.com 
Owner^  Public 
(Nasdoq:  EXCA) 
Moni  Prodod:  Excalibur 
RetrievnIWore  is  a  set  of  took  for 
bcalding  text-based  knowledge 
retrievol  solutions  that  con  seorch 
through  information  located  m 
corporate  networks,  including 
intranets,  the  Web  and  other 
knowledge  repositories. 
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Fulcrum 
Technologies  Inc. 

LcKotion:  Ottowo, 
Ontario,  Conodo 
URL  www.fulcnjm.com 
OwnersUp:  Public 
(Nasdoq:  f  UlCF) 
Main  Prodoct:  Fulcnmi 
Knowledge  Netwodc  is  on 
inforniotion  retrieval  solution  for 
business.  It  provides  ogents, 
document  summorizotion  and 
intuitive  seorches  of  the  Web,  file 
servers,  intranets  and 
dotoboses. 

Open  Text  Corp. 

Location:  WoteHoo, 
Ontorio 

URL  www.opentext.cwn 
Ownership:  Pubfic 
(Nflsdoq:  OTEXF) 
Main  Product:  livelink 
Intronet  is  o  Webteed,  scalable, 
colloboiotive  knowledge  monoge- 
ment  application  for  intranets.  A 
component  of  the  fechnok)gy, 
livelink  Spider,  aowls  designated 
intranet  and  Intemet  Web  sites, 
retrieving  doto  that  is  then 
indexed  ond  catalogued. 
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Visual  Insights 

Location:  Nopervile,  III. 

URL  www.lucenf.com/ 
visuolinsighfs 

Ownership:  A  unit  of  Lucent 
Technologies  Inc.,  (NYSE:  LU) 
Main  Product:  Visual  Insights 
is  interactive  data  visuolization 
software  developed  at  Bell  Lobs. 
It  disploys  30  macroscopic  views 
of  doto  thot  con  be  manipulated 
by  use5. 

ThemeMedia  inc. 

Location:  Redmond,  Wash. 

URL  www.thememedia.com 
Ownership:  Privofe 
Main  Product:  SPIRIX  is  a  pro- 
totype intended  to  reploce  text- 
bflsed  retrieval  ond  onolysis  with  on 
interactive,  30  visuclizctiofrbosed 
system  ihot  uses  londscopdike 
interfaces  to  categorize  doto. 


Perspecta  Inc. 

Location:  Son  Francisco 

URL  www.perspecta.com 
Ownershqi:  Private 
Main  Product:  The 

SmartContent  System  is  a  scal- 
able development  platform  for 
navigation,  online  analysis  ond 
profiling  for  applications  using  30 
visual  interfaces.  It  facilitates  inter- 
oction  between  a  compony  and  its 
customers,  distributors,  suppliers 
and  partners. 

Inxight 
Software  Inc. 

Location:  Palo  Alto,  Cal'rf. 

URL  www.inxight.com 
Ownership:  A  Xerox  Corp. 
enterprise  company  (NYSE:XRX) 
Main  Product:  VizControls  dis- 
plays large  quantifies  of  infornio- 
tion as  graphic  or  visual  represen- 
fofions.  Users  view  thousands  of 
data  points,  documents  or  infor- 
mation objects  at  once  in 
various  graphical  ways. 


Personal  Library 
Software  (PLS)  Inc. 

LocotioE  Rockville,  Md. 
URL  www.pls.com 
Ownenl^p:  Joint  vennjre 
Netscape  raid  G£  Information 
Senrices 

Main  Prodact:  The  PIS  family 
comlmes  enterprise  odministra- 
tion  with  h*text  information 
retrieiAil  on  corporate  intranets 
and  the  Web. 

Sovereign  Hill 
Software  Inc. 

LocotHMC  Horfey,  Mass. 
URL  wwwLSOvereigihhill.com 
Ownen%  Private 
Mda  Proikct:  Sovereign  Hill's 
InOueiy  dstrtuted  intelgent 
seorchocc^  qwies  in  plain 
EngSsh  and  then  fMs,  retrieves 
and  sorts  doto  hom  opprapnate 
dotobosei 


Semio  Corp. 

Location:  Son  Mateo,  Calif. 
URL  wwwjemio.com 
Ownership:  Private 
Main  Proifaict:  SemioMop  dis- 
plays relationships  between  cate- 
gories of  information  in  order  to 
focus  a  search  on  relevont  infor- 
motion  and  sift  out  the  inelevant. 
It  con  answer  open-ended 
queries,  such  as  'Why  has  our 
company  lost  mad(et  shore  to 
the  comperitton?" 


Jungle 
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searching  iJiroughi  multiple  data- 
bases will  also  be  important.  A  truly 
robust  software  stanckrd  for  data- 
base access  could  shift  the  battle- 
ground from  the  Web  back  to  the 
desktop-software  market. 

At  the  same  time,  search  is  be- 
coming a  commodity;  its  future 
value  depends  increasingly  on  the 
associated  data  and  appHcations. 
Poor  search  technology  mamed  to 
excellent  content  or  appUcations 
beats  great  search  technology  ap- 
pUed  to  trivia. 

For  investors,  that  means  search 
companies  without  soUd  appUca- 
tions are,  or  should  be,  passe.  The 
likely  winners  will  be  the  big  com- 
panies that  traditionally  take  over 
commodity  businesses  and  small 
companies  that  marry  search  to  more 
focused  applications. 

Many  search  tools  could  follow 
the  path  of  desktop  publishing 
to  become  a  feature  in  every  appU- 
cation  rather  than  something  that 
stands  alone.  "We  are  going  to  see 
search  functionality  built  into 
many  applications,"  predicts  Micro- 
soft's Dumais. 

Though  the  early  winners  will  be 
those  that  make  the  fewest  demands 
on  end  users,  search  is  important 
enough  that  users  will  change  their 
behavior  to  get  better  results.  And 
as  Vannevar  Bush  indicated,  it  will 
change  how  people  think,  offering 
die  freedom  to  for^get,  to  remember 
and  to  know  at  will. 

"[Man's]  excursion  may  be 
more  enjoyable  if  he  can  reacquire 
the  privilege  of  forgetting  the 
manifold  things  he  does  not  need 
to  have  immediately  at  hand," 
Bush  wrote,  "with  some  assurance 
he  can  find  them  again  if  they 
prove  unportant."  ■ 

Jeff  Uboii  (jubois@netcom.com)  is 
an  independent  consultant  who  fias 
been  digging  tfirougfi  large  databases 
and  writing  about  the  Internet  for 
more  than  i  decade 
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Alexa  Debuts  Search  Engine  That  Learns  from 
Its  Users 


NEWSwatcn  Arcnves 


ternet  yesterday  introduced  a  new  search  engine  that  is  designed  to  learn  from  its 


users,  providing  Website  recommendations  based  on  analysis  of  anonymous  Internet  usage 
patterns. 

Working  in  conjunction  with  standard  Web  browsers,  the  Alexa  navigation  service  resides  on 
the  browser  as  a  toolbar  displayed  at  the  bottom  of  a  page.  The  toolbar  lets  users  know  where 
they  are  on  the  Web,  providing  detailed  Website  information  including  who  the  site  is 
registered  to.  how  many  pages  the  site  hosts,  how  frequently  the  site  is  updated,  and  how 
popular  the  site  is  with  others. 

The  toolbar  also  suggests  other  sites  to  surf  to  next,  and  alerts  users  as  to  whether 
colleagues  and  fnends  are  online,  enabling  real-time  online  communication. 

The  technology  behind  Alexa  is  based  on  the  overall  traffic  pattems  of  the  Web,  the  ecology 
of  links  between  sites,  the  content  of  Web  pages,  and  anonymous  usage  paths. 

Available  for  Windows  95  and  NT,  an  Alexa  beta  version  is  cun-ently  available.  Genera) 
availability  of  the  free  service  is  projected  for  late  summer. 

NEWSwatch   stories   for   Friday,   July   18,   1997: 

o  Centra  Unveils  Online  Learning.  Collaboration  Tool 

o  Company  Offers  Ne^.vQrk  Design  Tool  for  LANs.  WANs 

o  Alexa  Debuts  Search  Engine  That  Learns  from  Its  Users 

o  StjreSite  Announces  Free  Web  Page  Builder 

o  Cn/stal  Grachics  Uncorks  3D  Graphics  Development  Tool 

O  NCSA  Founds  ISP  Security  Consortium 
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You'll  see  other  explanations,  but  the  real  reason  that  weh 
ct'tT-ulsed  yesterday \'  We've  got  a  full  moon. 

Firing  Transfer  Protocol:  FTP  Software  of  .Andover.  Mass. 
has  laid  off  300  of  its  3O<0  employees  because  of  contmumg  losses. 

Deep  pockets:  Microsoft's  prntlts  rose  S9  percent  m  the  past 
quaner.  and  company  spokesmen  are  already  trymg  to  lower 
e.xpectations  for  the  future. 

Bio-tech  warriors:  A  new  class  of  antibiotics  has  been  created 
by  researchers  at  Stanford  University  and  Kosan  Biosciences. 
They  genetically  altered  the  popular  anubiotic  erythromycin  to 
amve  at  the  discovery. 

Ban  phone  tag,  too:  The  U.S.  Senate  has  decided  no  more 
computer  games  for  gc^  emment  employees. 

Feel  like  a  trip?  Microsoft  is  thinking  about  overseas 
expansion,  particularly  ;n  Dublin.  Ireland,  or  India.  The  high-tech 
labor  pool  is  growing  shallow'  around  Redmond  (or  in  the  U.S.. 
for  that  matter),  which  is  making  .Microsoft  thjnk  global. 

No-meddle  zone:  A  Philadelphia  city  council  member  asked  a 
Congressional  subcommmee  to  let  the  locals  to-x  the  Net  if  they  so 
desire.  .Also,  a  motion  picaire  mogul  wants  more  copynght 
protection  for  creative  '.vcrks  on  the  Net. 

Pushing  is  hard:  Netscape  pushes  hard  for  its  .Netcaster 
software  to  beat  out  Miccsoft's  IE.  There's  another  beta  version 
available  now  and  the  acrjal  release  is  set  for  next  month.  (BTW, 
there's  a  bug  fix  for  Communicator  that's  also  available  now.  too.) 

Filmstrips  of  the  '90s:  For  an  alternative  view  of  computers  in 
schools,  the  .Atlantic  Monthly  casts  a  skeptical  eye  on  the  rush  to 
wire  our  schools  -  especially  at  the  expense  of  art,  music  and 
physical  education. 

Cue  the  scary  music:  Sybase's  Mitchell  Kurtzman  takes  dead 
aim  at  Oracle  with  the  best  quote  of  the  morning:  "You  know  in 
those  horror  movies,  where  there's  always  a  scene  when  people 
think  someone  is  dead,  and  then  that  person  comes  back  to  life  and 
scares  the  hell  out  of  ever.  body\*  I  think  we're  going  to  scare  the 
hell  out  of  Oracle.  They  better  watch  out."  (.Another  mter-iew  with 
Kurtzman  popped  up  recently  in  Steve  Gillmors  column.) 

Living  in  the  future:  The  only  sure  thing  about  how  the 
technology  world  will  look  in  the  next  five  years  is  that  it  will  be 
nothing  like  the  present.  Networking  of  homes  is  just  a  start. 

Re-engineering  surfing:  .A  new  development  from  a  guy  who 
invented  W.AIS  and  was  the  lead  engineer  on  he  Thinking 
Machines  parallel-processing  supercomputer  project:  Why  not 
-uggest  like  -'.ites  for  people  via  a  browser  companion  application? 
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Internet  historian  creates  tool  to 
inform  Web  site  visitors 

By  Renee  Deger 

July  13,  1997  4:22  PM  PDT 

2DNN 

The  Internet's  historian  now  wants  to  be  its  guide.  Brewster 
Kahle.  the  man  who  is  attempting  to  archive  even/  Web  page 
for  history,  has  just  started  a  new  venture  that  would  inform 
Web  page  visitors  all  about  the  site  they  are  visiting  -- 
even/thing  from  who  'uns  it  to  how  many  hits  it  records. 

His  Alexa  internet,  a  search  engine  now  undergoing  a  limited 
public  beta  test,  features  a  tool  bar  on  the  browser  page  that 
lists  information  about  the  Web  site  being  viewed.  Information 
includes  who  runs  the  site,  how  often  it  is  refreshed  and  how 
much  activity  it  sees.  The  tool  bar  also  offers  links  to  related 
sites,  and  gives  details  of  who  links  to  the  site. 

Such  information  would  have  come  in  handy  to  recent 
visitors  of  Nasa.com.  Expecting  news  about  Mars,  users 
instead  were  greeted  with  a  pornographic  site.  The  NASA 
site  IS  actually  Nasa.gov,  and  Alexa  Internet  could  have  told 
users  the  other  was  owned  and  maintained  by  an  entity  other 
than  the  U.S.  government. 

The  porn  site  is  just  a  symptom  of  an  emerging  problem  with 
cyberspace.  Anyone  can  throw  up  a  Web  site  and  lure 
people  in,  and  not  all  sites  are  so  obviously  revealed. 

The  source  of  Alexa  Internet's  data  is  the  Internet  Archive,  a 
proiect  begun  by  Kahle  last  year  to  save  for  posterity  every 
one  of  the  50  million  or  so  Web  pages  that  exist.  The 
cataloging  information  is  now  shared  between  Internet 
Archive  and  the  15-month-old  Alexa  Internet.  Both  are  based 
in  San  Francisco. 


Srewster's  trillions 


Letting  Custprngrg  Dig 
Through  Ycuf  Data 


Seme  Mars  seekers  found 
porn  instead 


The  Net's  historian  himself  has  a  long  history  with  the 
Internet.  To  help  the  average  user  tap  into  the  collective 
knowledge  stored  online,  Kahle  helped  seed  the  online 
publishing  industry  in  1989.  He  founded  WAIS  Inc.,  an 
electronic  publishing  company.  It's  since  been  sold  to 
America  Online.  He  also  was  a  co-founder  of  supercomputer 
maker  Thinking  Machines. 

New  the  critical  mass  of  online  information  that  he 
encouraged  is  now  ready  for  a  new  navigation  tool,  one  that 


offers  qualitative  analysis. 

Kahle's  site  Thursday  began  registering  beta  users.  He 
wants  to  enlist  10,000  users  for  the  test,  but  believes  the 
company  ultimately  can  accommodate  millions. 

The  search  engine  serves  a  dual  role.  It  will  tell  users 
information  about  the  site  they  are  visiting,  but  it  also  records 
the  visitor's  travel  habits.  The  program  automatically 
communicates  back  to  Alexa  all  of  the  sites  its  users  visit  and 
in  what  order,  then  aggregates  the  data.  The  results  are 
incorporated  into  updates  of  the  program,  which  are  sent  to 
users  regularly  and  automatically,  so  users  can  view  the 
traffic  patterns  even  on  new  sites. 

Kahle  said  the  purpose  of  tracing  user  habits  is  so  that 
people  can  learn  about  the  Web  from  each  other,  albeit 
anonymously. 

"We're  not  trying  to  profile  who  [users]  are,  we  don't  care.  We 
want  to  aggregate  the  paths  on  the  Internet,"  said  Kahle. 


The  search  engine  will  be  free  once  it  is  finished  later  this 
year,  said  Kahle.  The  company,  seeded  with  about  $5  million 
from  a  Swiss  investment  company,  will  rely  on  advertising  for 
revenue. 


The  ads  will  rest  in  the  toolbar  and  will  change  according  to 
the  kind  of  Web  site  the  user  is  looking  at.  Kahle  said  he 
plans  to  forbid  companies  from  buying  spots  that  pop  up  on 
the  toolbar  when  users  visit  a  competitor's  Web  site. 
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Alexa  Debuts  Search  Engine  That  Learns  from 
Its  Users 


NEWSwatc-^  -.-:-■ 


/esterday  introduced  a  new  search  engine  that  is  designed  to  learn  from  its 


Lasers,  providing  Website  recommendations  based  on  analysis  of  anonymous  Internet  usage 
patterns 

Working  :n  conjunction  with  standard  Web  browsers,  the  Alexa  navigation  service  resides  on 
the  browser  as  a  toolbar  displayed  at  the  bottom  of  a  page.  The  toolbar  lets  users  know  where 
they  are  on  the  Web,  providing  detailed  Website  information  including  who  the  site  is 
registered  to,  how  many  pages  the  site  hosts,  how  frequently  the  site  is  updated,  and  how 
popular  the  site  is  with  others. 

The  toolbar  also  suggests  other  sites  to  surf  to  next,  and  alerts  users  as  to  whether 
colleagues  and  fnends  are  online,  enabling  real-time  online  communication. 

The  technology  behind  Alexa  is  based  on  the  overall  traffic  pattems  of  the  Web,  the  ecology 
of  links  between  sites,  the  content  of  Web  pages,  and  anonymous  usage  paths. 

Available  'or  Windows  95  and  NT,  an  Alexa  beta  version  is  currently  available.  General 
availabilir/  of  the  free  service  is  projected  for  late  summer. 

NEWSwatch    stories   for   Friday,   July   18,   1997: 

O  Centra  Unveils  Online  Learning.  Collaboration  Tool 

O  Company  Ctfers  .Me^vork  Design  Tool  tor  LANs.  WANs 

o  A:exa  Deoi..is  Search  Engine  That  Learns  from  its  Users 

o  Si.reSite  -Announces  ^^ee  Web  Page  Builder 

o  Cn/sral  Gr.^chics  Unnrks  .3D  Graphics  Develcoment  Ted 

O  'ICS-^  r 'uncs   S?  Security  Consortium 
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Alexa  Browser  Companion  Makes 
Searches  More  Certain 

by    Lisa    Moskowitz,    PC    World 

July  IS.  1997 

It"  browsers  only  knew  your  train  of  thought  when  you  scanned  a  site,  life  would  be  grand.  No  more  clicking 
back  and  forth  between  search  engines  and  sites,  hoping  that  one  of  your  13.369  search  results  will  offer  up  a 
sliver  of  useful  information.  Sigh  no  more,  dear  Netizens,  .\le.\a  has  amved. 

.Announced  July  17.  .Ale.xa  Internet  is  a  free  Web  navigation  service  that  learns  from  users'  collective  surfing 
patterns.  Working  independently  of— but  in  conjunction  with— your  browser.  Alexa  provides  recommendations 
on  where  to  go  next  based  on  a  site's  content  and  the  subsequent  paths  taken  by  previous  visitors  to  the  site. 

For  example,  when  you  go  to  a  site  about  camping,  the  .Alexa  toolbar  lets  the  system  know  where  you  are  so  it 
can  retneve  data  about  that  site.  In  seconds,  you'll  receive  facts  about  the  site,  like  how  many  pages  it  has  and 
to  whom  it's  registered.  You'll  also  be  able  to  consult  a  list  of  related  sites  on  subjects  such  as  the  best  camping 
grounds  in  California  or  where  to  buy  a  tent.  These  suggestions  pop  up  in  a  Where  To  Go  section  and  are 
based  on  Alexas  user  tracking.  Subject-sensitive  advertising,  Alexa's  revenue  source,  is  also  displayed  in  this 
section. 

.As  you  surf,  your  search  path  is  sent  anonymously  to  .Alexa  and  incorporated  into  their  database.  The  next  user 
who  comes  to  that  site  can  then  benefit  from  your  search  decisions.  Pnvacy  is  not  compromised  because  a  ngid 
svstem  discards  the  user's  identuv  before  anv  path  details  are  forwarded,  accordinsj  to  .Alexa  co-founder  Bruce 
Giiliat. 

If  the  browser  can't  locate  a  site  for  whatever  reason.  .Alexa  will  pull  up  the  most  recent  rendition  of  it  from  its 
archive.  .Alexa  aJso  has  an  instant  chat  feature  for  real-time  communication  with  other  .Alexa  users. 

;a  is  the  brainchild  of  Giiliat  and  Brewster  Kahle,  the  meta-data  king  of  Wide  .Area  Information  Sen/er 
V  ..  .AIS)  fame.  Kahle  invented  the  information  retneval  system,  which  fetches  documents  by  keywords  and 
ranks  them  based  on  the  number  of  times  the  keywords  appears.  Sound  familiar'!'  It  should.  Kahle  sold  W.AIS 
to  .America  Online  in  1995.  He  and  Giiliat,  who  worked  at  WAIS,  then  staned  on  their  new  project;  archiving 
the  Web.  "We  thought  we  should  preserve  our  digital  hentage,"  Giiliat  said.  "The  question  then  was,  what  can 
we  do  with  this  information  to  provide  a  service  for  Net  users'^" 

The  answer  was  .Alexa.  The  stan-up  was  founded  in  1996  and  accesses  an  archive  containing  more  than  5TB 
about  5000  gigabytes)  of  Web  data.  .Alexa  requires  less  than  10  percent  of  a  user's  bandwidl;h  to  operate. 
Giiliat  said. 

The  beta  version  of  .Alexa  is  currently  available  for  dowTiload  on  the  Alexa  home  page  for  the  first  10,000 
registered  users.  The  program  works  with  Netscape  and  Microsoft  browsers  and  requires  Windows  95  or  NT. 
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New  Service  Tracks  Web  Use 


Bv  JOHN  MARKOFF 


SAN  ?R.\SCISC0.  July  19  -  A5  a 
second  ger.erauon  of  Internet  searc.'^ 
cools  emerges,  a  company  called 
Alexa  Internet  has  started  a  service 
offering  users  more  information 
about  the  popularity  and  location  of 
tne  pages  they  view 

~-.e  ser--:ce.  wmch  works  with  In- 
terr.e:  br:wsers  offered  by  the  Net- 
scape Communications  Corporation 
and  the  Microsoft  Corporation,  fo- 
cuses on  the  relatively  new  idea  of 
assisting  users  in  navigating  the 
World  W;de  Web. 

The  Alexa  service  was  founded  by 
Brewster  Kahle.  who  was  one  of  the 
founders  of  the  Thinking  .Machines 
Corpcrracion.  and  by  Wais  Inc..  an 
information  retrieval  company  ac- 
quired by  America  Online  Inc.  The 
new  sep.'ice  has  close  ties  to  the 
Internet  Arcnive.  a  non-profit  organi- 
:at:cn  that  is  attempting  to  capture 
and  maintain  a  digital  snapshot  of 
the  World  Wide  Web. 

The  .Alexa  service  attempts  to 
monitor  tne  behavior  of  Internet  us- 
ers systematically,  the  better  to  re- 
turn inform.acion  on  the  popularity  of 


particular  'A-es  pages  i  he  idea.  .Mr. 
Kahle  said,  is  to  provide  Inte.-r.e: 
users  w::.^.  some  idea  of  the  ■•con- 
text'" of  the  page  they  view,  as  well 
as  tips  on  where  to  look  next. 

This  mrorm.ation  emerges  both 
from  the  preferences  of  otner  Alexa 
users  and  from  analysis  the  ser-'ice 
has  done  about  the  millions  of  linxs 
connecting  different  Web  pages. 

"We  re  trying  to  use  the  paths  that 
people  are  leaving  through  the  .Net  as 
an  indicator  of  what  is  good."  Mr. 
Kanle  said. 

Anotner  advantage  of  the  Alexa 
service  is  that  it  can  retrieve  "dead" 
Web  pages  that  have  been  captured 
by  the  Internet  Arcnive.  The  need  for 
such  capaoility  has  been  growing  as 
inform.ation  often  appears  and  then 
vanishes  quickly 

Because  tne  .-Mexa  service,  located 
at  ■j/'A-w  alexa  com.  is  free,  it  will  b« 
financed  by  advertisements  that  ap- 
pear inside  the  program's  display 
while  the  Internet  user  browses  the 
Web. 

.Mr  Ka.Tle  said  that  since  the  serv- 
ice monitors  individual  user  behav- 
ior on  the  Web,  it  has  been  designed 
to  guarantee  user  anonym.ity  in  or- 
der to  protect  privacy 
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IPJNOVATIONS    /    By    LAURIE    J.    FLYNN 

Alexa    Internet:  The  Search  as  a  Communal  Effort 


AN  FRANCISCO 
Nearchmii  the  V\'eb 


\lex(i   [luemer  wanti  to  know  about  your  most  successful  expetiences 


The  company,  based  in  San  Franci>^.-.\  this  week  announced  a  new  Web  navigation  service  that  analyzes  the 
pathb  or"  peoples  searches  in  order  U'  otter  suggestions  to  others  looking  for  similar  information.  The  idea  is 
that  while  search  services  can  provide  a  general  list  of  matches  based  on  keywords  and  other  elements, 
there's  really  no  substitute  for  expenence. 

"Paths'  ore  really  the  Holy  Grail  of  the  Web."  said  Brewster  Kahle.  chief  executive  of  Alexa  Internet  and 
one  of  the  onginal  developers  or  the  World  Wide  W'eb.  "We  think  the  paths  that  people  leave  are  the  real 

value,  " 


Alexa  arrears  as  a  toolbar  at  the  bottom  of  the  screen,  just  below  the  browser.  The  toolbar  lists  four  sites 
that  It  re.v'mmends  the  user  ':inj>.  to  from  the  page  or.  the  their  screen,  basing  its  recommendation  on  searches 
that  other  people  have  conducted  that  brought  them  :o  the  same  page. 

'We  want  to  know  where  else  did  they  go  where  they 
had  a  good  time,"  said  Kahle.  who  added  that  a  "good 
time"  is  defined  by  the  how  long  they  spent  at  each  link 
and  whether  they  clicked  through  further.  The  Alexa 
technology  records  people's  paths,  then  combines  that 
information  with  data  about  the  content  of  pages  to  come 
up  with  the  best  suggestions. 

"We  want  to  help  people  avoid  the  mistakes  of  others," 
Kahle  said. 

For  example,  someone  using  a  directory  service  to  find 
information  about  Ford  cars  would  inevitably  be 
directed  to  the  home  page  of  the  Ford  Showroom,  other 
otTicial  Ford  sites  and  whatev  er  other  sites  the  directory  deemed  relevant.  Once  at  the  Ford  Showroom  site, 
however,  an  Alexa   user  would  have  the  added  benefit  of  Alexa  's  list  of  suggested  links,  which  it  has 
compiled  based  on  the  paths  other  visitors  to  the  Ford  Showroom  have  taken. 

Kahle  descnbed  the  serMce  as  something  like  a  combination  of  .AltaVista,  considered  one  of  the  most 
powerful  and  comprehensive  search  engines,  and  F-rerlv.  an  intelligent  agent  that  makes  recommendations 
based  on  past  behavior.  Alexa  .  however,  is  almost  entirely  based  on  technology  developed  by  the 

^•-^mranv 


And  Kahij  >aid  that  the  technoji.igv  ha.s  at  least  one  major  advantage  over  directones  and  search  engines 
auuic   It    r-.en  rinds  the  small,  ^.'metimes  local  sites  that  the  major  search  services  sometimes  overlook. 
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rjailv  doubling  every  six  months 
[^'U'zh  time."  Kahle  said. 


and  our  information  shows  that  it  is  --  the 


"We  want  to  help  people 
avoid  the  mistakes  of 
others." 


Alexa  also  displays  information  about  traffic  to  the  site,  the 
number  of  pages  it  contains  and  the  speed  at  which  the  user  is 

connected. 

In  addition  to  listing  the  four  most  recommended  sites,  a  pop-up 
menu  lists  several  others  that  the  user  might  want  to  visit  and  it 
allows  the  user  to  add  other  sites.  Each  time  a  user  adds  a  link  to 
the  recommended  list,  Alexa  sends  an  anonymous  message 
back  to  the  company  to  become  pan  ot  its  database  of  paths. 


That  way,  the  more  people  who  use  Alexa  ,  the  better  a  navigation  service  it  becomes,  he  said. 

The  idea  for  Alexa  grew  out  of  another  of  Kahle's  projects,  the  Internet  .Archive,  a  effort  to  document  and 
store  Web  pages  and  Usenet  postmgs  for  a  histoncaJ  record  and  to  provide,  in  the  event  of  an  outage,  a  sort 
of  backup  system. 

In  Thursday's  outage  of  several  of  the  Internet's  name  servers,  for  e.xample,  the  Internet  Archive  e.xisted  as  a 
sort  ot  static  mirror  of  nearly  every  Web  page.  The  .Archive  currently  has  5  terrabytes  of  pages  (five  million 
megabytes). 

Kahle  is  no  stranger  to  developing  new  technologies  for  the  Web.  In  1989  he  invented  the  WAJS  technology 
for  searchmg  the  Web,  and  founded  W.AIS  Inc.,  an  electronic  publishing  company  that  he  later  sold  to 
.America  Online.  Before  that  he  helped  found  Thinking  Machines,  a  maker  of  supercomputers. 

Related   Sites 

Following  are  links  to  the  external  Web  sites  mentioned  in  this  anicle.  These  sites  are  not  part  of  The  .N'ew  York  Times  on  the 
Web,  and  The  Times  has  no  control  over  their  content  or  availability.  When  you  have  finished  visinng  any  of  these  sites,  you 
will  be  able  to  return  to  this  page  by  clicking  on  your  Web  browsers  'Back "  bunon  or  icon  until  this  page  reappears. 


•  AltaVista 

•  Firefly 

•  The  Internet  .Archive 
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Technology 

BellSouth  cuts  off  AirTouch  in  Brazilian  wireless  bid 


Steve   Ginsberg 

AirTouch  Communications  is  suil  loolong  for  a  Laun  Amencan  toehold  for  its  wireless  services  after 
BellSouth  outbid  the  San  Francisco-based  company  for  South  Amenca's  business  capital.  Sao  Paulo. 

At  S2.5  billion,  BellSouth's  winning  bid  was  four  times  higher  than  the  lowest  acceptable  bid  set  by  the 
government.  The  bid  was  also  four  times  higher  than  any  wireless  license  for  other  South  American  markets. 

.AirTouch,  which  finished  third  in  the  bidding,  is  proceeding  cautiously  in  South  America.  "We  were  not 
prepared  to  win  at  any  cost."  said  Kathy  Reinhart.  director  of  investor  relations.  "We  have  strict  investment 
hurdles." 

AirTouch  has  bids  in  for  five  other  Brazilian  markets.  It  won't  have  BellSouth  to  contend  with  because  of 
rules  restncting  license  holders  to  only  one  major  market  in  Brazil. 

With  Its  international  push  led  by  President  .Arun  Sarin.  .AirTouch  is  dominant  in  Europe,  and  the  cheap 
licenses  it  landed  early  on  mav  have  spoiled  the  company  for  an  era  of  inflated  license  fees.  In  1992, 
.\irTouch  got  the  license  for  Germany  and  didn't  have  to  pay  an  auction  pnce. 

BellSouth  is  staking  out  Latin  .\menca  after  entenng  .Argentma  in  1989.  It  is  already  operating  in  eight 
countries,  including  Brazil. 

.\lexa  Internet  lands  at  the  Presidio 

Internet  startups  are  usually  confined  to  gntty  South  of  .Market  Street  walkups  in  San  Francisco  or  viewless 
Peninsula  offices.  But  the  20  employees  at  .A.'le,xa  Internet  have  views  that  would  make  most  seven-figure 
CEOs  jealous.  .A.le.xa  is  among  the  fu-st  for-prorlt  companies  to  land  at  the  Presidio  National  Park  in  San 
Francisco. 

.Ale.xa.  a  spinoff  of  a  nonprofit  already  housed  at  the  former  army  base,  focuses  on  web  navigation.  'The 
companv  launched  its  web  site  this  week.  The  companv  is  the  brainchild  of  Bruce  Gilliat  and  Brewster 
Kahle.  ' 

E.A's  annual  report  is  fun  and  games 

.Annual  reports  have  long  been  not  only  dissenations  on  financials.  but  a  way  for  companies  to  market 
themselves.  Readers  of  Electronic  .Arts'  fiscal  1997  annual  report  won't  encounter  a  dollar  sign  until  page  13. 
The  report's  cover  states  in  stark  black  and  white.  "Warning:  This  annual  report  is  neither  interactive  nor 
entertaining." 

The  ne.xt  10  pages  work  to  dispel  that  notion,  showing  clips  of  dunking  hoopsters.  racing  cars  and  attacking 
helicopters.  .A  reference  to  an  E.A  game  on  the  page  opposite  the  basketball  clip  states  merely,  "It  doesn't  let 
you  dunk." 

E.A  can  afford  to  be  jocular.  The  company's  1997  numbers  were  good.  Revenues  grew  17  percent  to  $625 
million,  while  net  income  increased  to  S53  million--  a  30  percent  jump.  EA  gains  came,  in  part,  because  it 
guessed  nght.  Its  alignment  with  Sony  PlayStation  accounted  for  30  percent  of  net  revenues,  compared  with 
j^ust  9  percent  in  1996.  Net  revenues  from  Sega  dropped  from  28  percent  to  16  percent. 

E.A  IS  putting  Its  healthy  cash  flow  to  work.  The  company  e.xpects  to  complete  its  acquisition  of 
underachieving  Ma.xis  for  around  SI  25  million  in  stock  by  Sept.  1. 

Oakland  set  to  tap  Oracle  for  contract 

Oakland's  city  council  was  expected  to  select  Oracle  Corp.  of  Redwood  Shores  to  solve  the  city's  year  2000 

:om/san(ranc:iSCO/ stories/ 


software  problems. 

A  city  council  committee  selected  Oracle  over  Informix  for  the  S16  million  contract,  the  most  lucrative 
computer  contract  ever  awarded  in  the  Tmancially-strapped  city  The  software  developer  will  address  the 
city's  year  2000  problem  as  well  as  connect  its  1.700  desktop  PCs  so  that  depanments  can  communicate  with 
each  other. 

Several  smaller  Oakland  tech  companies  will  get  a  piece  of  the  Oracle  contract,  including  Lily  Hu  & 
Associates,  F2  Technologies  and  Cordoba  Corp. 

Deal  update:   Vanstar.  U.S.  West  make  buys 

In  its  bid  to  offer  more  extensive  technological  services  to  the  federal  government,  Vanstar  is  acquiring 
Virgirua-based  Systorex  in  a  stock  and  cash  deal.  Pleasanton-  based  Vanstar  will  use  500,000  shares  of  stock 
and  S46  million  in  cash  to  buy  its  former  partner,  Systorex,  a  S150  million  company. 

U.S.  West  has  become  the  single  largest  investor  in  San  Mateo's  Wire  Networks,  a  content  provider  of 
women's  issues  on  the  Internet.  The  size  of  the  investment  was  not  disclosed.  Wire  will  create  content  in 
several  cities  for  U.S.  West's  Divein  web  service.  Wire  has  raised  $8  million  since  its  1992  founding  and 
went  online  in  1995  with  Women's  Wire.  The  company  will  grow  from  45  to  around  75  employees  this  year. 
Recent  hires  included  chief  financial  and  technical  officers. 

®  /  997,  San  Francisco  Business  Times 


More  News  Columns 


Page;  2 

■!D://www,amcity.com;santrancisco/stori9S/ 

'2197/newscolumn4  ntml 


RETURN  TO  TECHSERVER:    NORM  \  I.  il  LOW-GRAPHICS 


build  a  network 

out  ot  thin  air. 


New  service  tracks  Web  use 
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Copyright    ®    1997    N.'^'.    Times    News    Service 

SAN  FR.\NCISCO  (July  21.  1997  04:39  a.m.  EDT)  -  As  a  second  generation  of  Internet  search  tools  emerges,  a  company  called  Ale.xa 
Internet  has  staned  a  service  offering  users  more  information  about  the  popularity  and  location  of  the  pages  they  view. 

The  service,  which  works  with  Internet  browsers  offered  by  Netscape  Communications  Corp.  and  Microsoft  Corp.,  focuses  on  the  relatively 
new  idea  of  assistmg  users  m  navigating  the  World  Wide  Web. 

The  .A.le,xa  ser-ice  was  founded  by  Brewster  Kahle,  who  was  one  of  the  founders  of  Thinking  Machines  Corp.,  and  by  Wais  Inc..  an 
information  retrieval  company  acquired  by  .Amenca  Online  Inc.  The  new  service  has  close  ues  to  the  Internet  Archive,  a  nonprofit  organization 
that  is  attempting  to  capture  and  maintain  a  digital  snapshot  of  the  World  Wide  Web. 

The  .A.!e.xa  ser-ice  attempts  to  monitor  the  behavior  of  Internet  users  systematically,  the  bener  to  return  information  on  the  populanty  of 
particular  Web  pages.  The  idea.  Kahle  said,  is  to  provide  Internet  users  with  some  idea  of  the  'context"  of  the  page  they  view,  as  well  as  tips 
on  where  to  look  next. 

This  information  emerges  both  from  the  preferences  of  other  .Alexa  users  and  from  analysis  the  service  has  done  about  the  millions  of  links 
connecting  different  Web  pages. 

-V'e're  trying  to  use  the  paths  that  people  are  leaving  through  the  Net  as  an  indicator  of  what  is  good,"  Kahle  said. 

Another  advantage  of  the  .AJexa  service  is  that  it  can  retrieve  "dead"  Web  pages  that  have  been  captured  by  the  Internet  Archive.  The  need  for 
such  capabiUtv'  has  been  growing  as  information  often  appears  and  then  vanishes  quickly. 

Because  the  Alexa  service,  located  at  www.alexa.com,  is  free,  it  will  be  financed  by  advertisements  that  appear  inside  the  program's  display 
while  the  Internet  user  browses  the  Web. 

Kahle  said  that  since  the  service  monitors  individual  user  behavior  on  the  Web,  it  ha5  been  designed  to  guarantee  user  anonymity  in  order  to 
protect  pnvacy. 
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Today's  News  and  Analysis 

•  Real  Standards  for  Streaming  Media 

Microsoft  and  Progressive  Networks  agree  ;:  partnerships. 

•  Mail:  Killer  Java  Add^ 

Lotus  and  Sun  prepare  Java  nnail  clients  for  :.-oss-platform  corporate 
use. 

•  Needed:  internet  Emergency  Plan 

A  nationwide  comcuter  glitch  underscores  tre  Net's  vulnerability. 

•  Life  After  50,  CYt;er?pace-styie 

Baby  boomers  (and  their  parents)  take  to  th-^  /Veb  in  record 
numbers. 


News.Scan 


Today's  Headlines  from  the  Mainstream  Pres 

Intel  has  settled  a  class  action  suit  brouc 
the  disclosure  last  year  that  an  error  in  testir 
to  overstate  the  speed  of  its  Pentium  microc 
10  percent.  Intel  denied  that  ;t  disseminatec 
information,  but  agreed  to  provide  a  S50  ret 
of  Intel  Overdnve  processors  to  customers  v. 
containing  a  12O-MH2  or  133-MHz  Pentium 
October  23,  1995,  and  January  5,  1996.  [N'' 
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The  internet  address  pool  is  dangerousi".   ow.  In  the  early 

1980s  the  Internet  was  crafted  with  a  system  of  4.3  billion 
potential  network  addresses-roughly  one  ac  :ress  for  every 
person  on  Earth.  Yet  today,  with  a  mere  50  r  ,lion  users  online, 
the  network  address  pool  is  running  dry.  Wit   out  new 
addresses,  the  net//ork's  expansion  will  soc    cease.  Scientists 
have  a  solution  in  the  wings,  but  as  changes  are  put  in  place 
over  the  next  two  to  five  years,  the  Net  is  lik--  y  to  be  fragile. 
[SJMN  7/21/97  p.  El] 

LinkExchange,  a  San  Francisco  start-up,  gives  small-time 
Web  sites  wider  exposure  by  creating  a  mar<etplace  in  which 
they  can  swap  advertising  space  on  their  own  sites  for  space  on 
others.  It's  one  of  the  few  examples  of  an  Intemet  company  with 
a  business  model  based  on  how  the  Internet  works  instead  of 
imposing  an  old-line  model  on  cyberspace.  [WSJ  7/21/97 
P.B7C] 

Alexa  Internet  is  a  new  Web  service  that  attempts  to  monitor 
the  behavior  of  Internet  users  and  reveal  the  populanty  of 
□articular  Weo  pages.  Started  by  Brewster  Kahle,  a  founder  of 
Thinking  Machines,  and  by  Wais,  an  information  retrieval 
company  acquired  by  America  Online,  the  service  uses  the 


NEW  PRODUCTS 


[PCs] 

PC  Labs  gets  its  hands  on 
AMD's  K6  processor. 

[SOFTWARE] 

Six  software  packages  for 
making  or  filling  out  electronic 
forms. 

Read  about  the  latest  3-D  home 
improvement  software  from 
Books  That  Work  and  Softdesk. 
[PERIPHERALS] 

Four  new  DAT  dnves  with  mass 
storage  capabilities. 

Prodtect  your  workstation  or 
sender  with  a  UPS. 

Hewlett-Packard's  multifunction 
OfficeJet  570  may  be  the  last 
pnnter  you'll  ever  need. 

[INTERNET] 

Use  the  Web  to  get  to  know 
your  customers. 

Build  communities  with  Creative 
Technologies's  Creative  Inspire. 


preferences  of  other  Alexa  users  and  analyses  of  the  millions  of 
links  connecting  different  Web  pages.  [NYT  7/21/97  p.D4] 

Excite  IS  expected  to  announce  today  Mailexcite.  a  new 
Web-based  electronic  messaging  sen/ice  that  gives  users  a  free 
e-mail  address  and  account.  Users  will  be  able  to  check  their 
e-mail  from  any  ccmputer  using  a  Web  browser.  [NYT  7/21/97 
P.D91 

CNN/Sports  Illustrated  yesterday  announced  SportSim  a 
free  fantasy  sports  Web  site  where  players  can  pit  their  dream 
teams  against  teams  concocted  by  other  players  [NYT  7/21/97 
p.DiO] 


[NETWORKS] 

Fbre  Chappql  connectivity  can 
help  eliminate  network 
twttlenecks. 

New  Gigabit  Ethernet  products 
for  ultrafast  networks. 


The  bigger  the  Internet  gets,  the  more  new  ■communities*  seem 
to  pop  up.  Thousands  of  sites  from  CBS  News  to  Yahoo 
sponsor  chats  every  day.  Others:  PeooleUnk.  Talk  Citv. 
Webchat  Broadcasting  System  and  The  Palace.  [NYT  7/21/97 
p.D6] 

Louis  Rossetto,  a  founder  of  Wired  magazine,  will  step 
down  as  CEO  of  Wired  Ventures,  but  wiil  continue  as 
editor-in-chief  and  publisher  of  Wired  and  chairman  of  Wired 
Ventures.  [NYT  7/21/97  p.D9] 

High-tech  dairy  and  cattle  farms  use  electronic  monitors 
and  notebook  computers  to  track  animals'  weights,  heights, 
food  consumption,  health,  milk  output,  and  beef  quaiitv  [NYT 

7/21/97  p. D4] 

Burning  Man,  a  five-day  celebration  in  Nevada's  Black  Rock 
Desert,  is  the  holiday  of  the  digerati.  [NYT  7/21/97  p.D4] 

Compiled  by  Carol  Levin  and  Leslie  Sonch 
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New  Guide  Offers  Net  Directions 
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The  next  guide  for  directing  Internet  visitors  to  useful  sites  may  be  other  Internet  visitors. 

The  inventor  of  the  pioneenng  Net  publishing  service  known  as  the  Wide  .\rea  Information  Server 
system.  Brewster  Kahle.  is  at  it  agajn.  This  time,  he  is  launching  a  service  that  tells  Web  users  details 
about  each  site  they  visit  and.  at  the  same  time,  recommends  other  sites  they  may  be  interested  in.  based 
on  analysis  of  surfing  patterns  of  other  visitors. 

The  new  service,  called  Alexa,  appears  inside  a  toolbar  that  attaches  itself  to  the  bottom  of  a  user's 
browser.  "Screen  real  estate  is  cenajnly  the  most  valuable  real  estate  in  the  world."  said  Kahle.  "We  have 
to  be  worth  more  than  real  estate  in  Tokyo." 
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To  achieve  such  worth,  Kahle's  Alexa  Internet  Corp..  based  in  San  Francisco,  has  been  building  up  a 
huge  archive  of  Internet  pages.  From  the  summer  of  1996  to  the  present.  Alexa  has  been  sucking  in  any 
page  that  has  appeared  on  the  Web,  building  up  a  repository  of  6  tnllion  bytes  of  Net  information. 

That  will  allow  the  company  to  track  usage  by  .\lexa-equipped  browsers  of  not  only  current  pages  on  'Jie 
Web  but  also  out-of-date  pages.  If  a  page  can't  be  found  at  its  onginal  address,  an  .\lexa-equipped 
browser  can  pull  it  up  from  .Alexa's  archive. 

But  to  succeed.  .Alexa  will  need  users.  Its  recommendations  will  be  based  on  traffic  patterns  as  captured 
by  .Alexa  browsers.  The  toolbar  will  display  sites  of  related  interest,  based  on  previous  users'  activities. 

The  bar  also  will  provide  pop-up  access  to  data  about  the  site  being  visited,  such  as  who  runs  the  site, 
where  that  person  or  company  is  located,  how  fresh  the  data  is  and  other  charactenstics.  That  data  will  be 
linked  to  vanous  other  sources  of  information,  such  as  maps  and  financial  filings,  that  will  allow  users  to 
better  judge  the  quality  of  the  site.  Ratings  of  the  sites  also  will  be  automatically  generated,  based  on  data 
collected  bv  the  .Alexa  browsers  from  users. 
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The  brow  ser  add-on  will  be  free,  with  registration  to  receive  beta  copies  beginning  last  Thursday  at  the 
company's  site.  .Alexa  Internet  will  support  itself  through  advertising,  linked  to  various  features  of  the 

toolbar. 

Kahle's  previous  venture.  W.AJS  Inc.,  was  sold  to  America  Online  Inc.  in  1995.  It  was  based  on  Wide 
.Area  Information  Server  technology. 

___    can  be  reached  at  www.alexa.com 
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New  search  tool  tracks  user's  path  through  Internet 


New  York  Times  News  Service 

22-Jul-1997   Tuesday 

As  second-generation  Internet  search  tools  emerge,  a  company  called  Alexa 
Internet  has  started  a  service  offenng  users  more  informauon  about  the 
popularity  and  location  of  the  pages  they  view. 

The  service,  which  works  with  Internet  browsers  offered  by  Netscape 
Communications  Corp.  and  Microsoft  Corp..  focuses  on  the  relatively  new 
idea  of  assistmg  users  in  navigating  the  World  Wide  Web. 

The  Alexa  service  attempts  to  monitor  the  behavior  of  Internet  users 
systematically,  the  better  to  return  information  on  the  popularity  of 
particular  Web  pages.  The  idea  is  to  provide  Internet  users  with  some  idea 
of  the  "context"  of  the  page  they  view,  as  well  as  tips  on  where  to  look 
next. 

This  information  emerges  both  from  the  preferences  of  other  Alexa  users  and 
from  analysis  the  service  has  done  about  the  millions  of  links  connecting 
different  Web  pages. 

.Another  advantage  at  the  .Alexa  service  is  that  it  can  retneve  "dead"  Web 
pages  that  have  been  captured  by  the  Internet  .Archive.  The  need  for  such 
capability  has  been  growing  as  information  often  appears  and  then  vanishes 
ickly. 

The  Ale.xa  service  (hctp7/ 

www.ale.xa.com)  is  free.  It  will  be  financed  by  advertisements  that  appear 

inside  the  programs  display. 
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News  Alert  Joins  ADSmart  Network 

[July  23,  1997]  New  York  City-based  News  Alert,  an  information  service  for  investors,  said  that  it  has  joined  ADSman.net 
and  will  begin  offering  custom  sponsorships  and  interstitial  advertising,  in  addition  to  standard  banner  advertising. 

News  Alert  claims  to  deliver  more  than  3.5  million  monthly  page  views,  and  68'^c  of  all  News  Alert  viewers  trade  stocks  at 
least  once  a  month. 

.Advertisers  can  now  sponsor  My  Portfolio,  News  .Alert's  personalized  portfolio,  as  well  as  personal  news,  live  links  to  online 
trading  sites.  Trading  comer  and  co-branded  content  on  News  Alert  partner  sites,  including  PC  Quote,  Individual  Investor 
Onhne,  Hoovers  Online  and  Zack's  Investment  Research,  Motley  Fool.  Edgar  On-line,  Wise  Wire,  Planet  Direct  and  Nequity. 
Rate  card  data  was  not  disclosed. 

.AJDSmart.net  is  an  advertising  sales  and  management  network  specializing  in  custom-designed  sponsorship  opportunities  for 
interactive  advertising.  .ADSmart  is  a  wholly  owned  subsidiary  of  CMG  Information  Services. 

Auto-By-Tel.   AllApartments   Sign   With   Classirieds2000 

Santa  Clara.  C.A-based  Clas^ir";eds:()0()  Inc.  announced  strategic  agreements  to  distnbute  the  content  of  Auto-Bv-Tel  and 
.AlKApanmenrs  through  its  classified  ad  network.  Financial  terms  of  the  deal  were  not  disclosed. 

■'.AllApartments  and  Auto-By-Tel  are  category  leaders  in  their  respective  fields  on  the  Internet,  and  we're  pleased  to  offer  them 
an  opportunity  to  reach  even  more  motivated,  targeted  users  through  the  CIassifieds2000  distnbution  nerwork. "  said  Sani 
El-Fishawy,  CEO  of  ClassifiedsZOOO. 

Classifieds2000  distributes  its  database  of  classified  advertising  content,  display  advertising  and  products  to  more  than  50  Web 
publishing  partners,  including:  search  sites  E.xcite,  Lycos,  InfoSeek  and  WebCrawler,  top  ISPs  AT&T  WorldNet,  Concentric 
Network  Corp.,  MindSpnng  Enterpnses  Inc..  Netcom  On-Line  Communications  Services  Inc.  and  WebTV  Networks  Inc.; 
and  popular  category  sites  including  Car  and  Dnver  and  Road  and  Track. 

US  Interactive  Launches  Campaign  for  Newspaper  Network 

New  York  City-based  digital  marketing  agency  US  Interactive  has  joined  with  the  newspaper  consortium  New  Century 
Network  (NCN)  and  the  latter's  NewsW^rks  Web  site  for  the  launch  of  NCN's  initial  consumer  advertising  campaign  online. 

US  Interactive  said  it  created  live,  Java  banner  ads  for  the  campaign.  Billings  were  not  disclosed. 

The  .NewsWorks  campaign  will  run  through  the  first  two  weeks  of  September  on  Excite,  Yahoo!,  Netscape,  Infoseek  and 
Lycos.  Creative  executions  were  developed  in-house  by  NCN. 

"This  campaign  will  aggressively  establish  the  NewsWorks  brand  in  the  minds  of  Web  users,"  said  Paul  Kessinger,  vice 
president  of  marketing  for  New  Century  Network.  "Not  only  are  we  targeting  known  Web  users,  but  we're  doing  it  in  the  two 
arenas  netizens  understand  most:  on  sites  that  they  frequently  visit  and  via  the  use  of  engaging  Java  banners  and  animated  gif 
banners. ' 


.Meclizine,   Mediconsult.com  Sign  Marketing  Deal 


Medi^.-'iKiilt  yiini  in  Boston  said  that  it  has  signed  an  exclusive  content  and  tnarketing  agreement  with  MediZine.  a  marketing 
company  that  reaches  consumers  through  an  education-based  bi-monthly  guidebook  senes  distnbuted  on  pharmacy  counters. 

MedicGnsult.com  will  be  MediZine's  exclusive  Internet  panner  and  will  develop  a  MediZine  branded  area  on  the 
Mediconsult.com  virtual  medical  center  Web  site.  Financial  terms  of  the  deal  were  not  disclosed. 

MediZine  will  include  Mediconsult.com  in  its  editonal  and  marketing  materiaJs  for  patients  that  are  currently  disuibuted 
through  more  than  8.50(3  retail  pharmacies  in  the  United  States. 

The  agreement  will  enable  Mediconsult.com  and  MediZine  to  leverage  advertising  on  the  Internet,  and  across  MediZine's  1.25 
million  heaJth-specific  guidebooks,  the  companies  said.  "Combining  both  print  and  VVeb  advertising  opportunities  will  generate 
greater  ROI  for  potential  sponsors  through  one-package  offering."  the  companies  sajd  in  a  statement. 

Mediconsult.com  is  a  "virtual"  medical  center  on  the  Internet  chat  offers  peer  reviewed  medical  information  on  more  than  50 
chronic  medical  conditions.  The  company  partners  with  health  care  providers  such  as  pharmaceutical  companies,  HMOs. 
hospitals,  non-profit  groups,  physicians  organizations  and  chantable  research  groups.  Revenues  are  denved  from  products 
and  service  sales,  Internet  marketing  consulting,  content  development,  content-based  advertising  and  educational  grants. 

I/PRO   Guarantees    10-Dav    .Audit  Delivery 

San  Francisco-based  Internet  Profile'^  Corp  iI/PRO>  said  that  the  company  will  now  offer  a  10-day  delivery  guarantee  to 
Nielsen  I/PRO  L/ALT)rT  customers,  surpassing  what  it  called  the  industry's  current  practice  of  delivering  audit  statements  up  to 
60  days  after  the  specified  audit  penod. 

Said  I/PRO's  president  and  CEO  Bradley  Rode:  "We  can  give  customers  and  Web  advertisers  confidence  that  reports  will  be 
there  when  they  need  them  while  still  having  the  reliable  data  that  they  have  come  to  expect  from  I/PRO." 

I/PRO  said  It  has  produced  more  than  5.000  audit  reports  since  I/.M/DIT  was  launched  in  1995. 

I/PRO  analyzes,  correlates,  and  validates  Web  activity  that  enables  marketers  to  understand  their  customers  and  site  usage. 
Clients  include  .\T&T  WorldNet.  CBS.  Chicago  Tribune  .  CMP  Publication  Inc.'s  TechWeb,  Infoseek,  Individual  Inc.. 
Nando.net.  Stan^ave.  Yahoo'  and  i'SA  Today  Online. 

New  Ad-Backed  Service  to  Monitor  Context  of  VVeb  Pages 

San  Francisco-based  .Aiexa  Internet  has  started  a  new,  ad-backed  service  offenng  users  more  information  about  the  popularity 
and  location  of  the  pages  they  view. 

The  company  was  founded  by  Brewster  Kahle.  who  was  one  of  the  founders  of  Thinking  Machines  Corp.,  and  by  Wais  Inc., 
an  information  retneval  company  acquired  by  America  Online  Inc.  according  to  The  \'e\\.  York  Times  .  The  new  service  has 
close  ties  to  the  Internet  .Archive,  a  nonprofit  organization  that  is  attempting  to  capture  and  maintain  a  digital  snapshot  of  Web. 

Alexa.  which  requires  a  download,  promises  "where  to  go  next '  suggestions  and  behind  the  scenes  facts  about  the  site  you  are 
visiting.  .Mexa  works  in  parallel  with,  and  independently  of  Web  browsers  and  is  displayed  as  a  toolbar  on  the  user's 
desktop 

The  service  attempts  to  monitor  the  behavior  of  Internet  users  systematically,  to  return  informauon  on  the  popularity  of 
panicular  Web  pages.  The  idea.  Kahle  told  the  Times ,  is  to  provide  Internet  users  with  some  idea  of  the  "context"  of  the  page 
they  view,  as  well  as  tips  on  where  to  look  next. 

"We're  trying  to  use  the  paths  that  people  are  leaving  through  the  Net  as  an  indicator  of  what  is  good."  Kahle  was  quoted  as 
saying. 

Another  advantage  of  the  Alexa  service  is  said  to  be  that  it  can  retneve  "dead"  Web  pages  that  have  been  captured  by  the 
Internet  .Archive. 

Cable  Channel  Site  Features  Old  TV  Commercials 

Banner  ads  ore  one  thing,  but  will  Web  surfers  tune  in  to  see  clips  of  old  TV  commercials'^  WTV  Networks  Inc.  thinks  so. 

Video  clips  of  classic  TV'  commercials  (can  you  say  "Where's  the  beef")  will  be  featured  on  a  newly  launched  combined  Web 
lUe  for  cable  channels  Nick  at  Nite  and  TV  Land,  according  to  Cable  World  magazine. 

Also  on  tap  are  program  listings,  descriptions  and  other  promotional  information  for  both  Nick  at  Nite  and  TV  Land.  The  site 
is  running  a  consumer  sweepstakes  to  draw  computer  users  to  its  four  electronic  arcade  games. 
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OUTSIDE  THE  BOX:  Once  upon  a  time,  mainframes  ruled  the  computing  world  and  a  group  o(  men 
and  women  at  a  Cambndge,  Mass..  outfit  called  Thinking  Machmes  decided  to  make  a  big,  fast,  fat 
computer  that  would  function  the  way  people  work.  Just  last  week  a  few  of  them  thought  they  might  be 
seemg  that  dream  finally  come  true. 

~~It  was  an  idea  10  years  in  the  making,"  said  Lew  Tucker,  who  now  manages  software  vendor 
relations  at  JavaSoft.  Tucker  was  one  of  many  Thinking  Machine  alumni  at  a  party  held  to  celebrate  the 
launch  of  a  new  service,  Alexa  Internet,  staned  by  Thinking  Machmes  founder  Brewster  KahJe.  ^~The 
saying  was,  "We  wanted  to  build  a  machine  that  would  be  proud  of  us.' "  he  joked. 

That  idea,  of  course,  failed.  But  some  of  the  theonzing  that  Thinking  .Machines  encouraged  --  stuff  about 
networks  and  anificial  intelligence  --  has  helped  make  the  Interent  and  the  World  Wide  Web  what  it  is 
today.  .A.nd  while  the  grand  idea  has  mutated,  the  computer  scientists' joy  in  being  panly  nght  is 
undiluted. 

The  geeks  were  so  happy  they  were  dancing  at  this  reunion,  cocktail  pany.  product  promotion  and  demo 

rolled  into  one  happy  event  at  the  well-lighted  and  militanly  sparse  officers'  club  at  The  Presidio  of  San 
Francisco,  near  Alexa's  offices. 

.AJexa  is  an  Internet  and  Web  search  tool  that's  trying  to  be  as  sman  as  the  people  using  it.  Instead  of 
letting  users  blindlv  2uess  the  links  between  Web  sites.  .Alexas  software  refines  a  search,  taking 
information  about  where  other  users  have  gone  to  point  out  areas  of  particular  interest  for  those  on  the 
same  path. 

"'This  is  Thinking  .Machines  crossed  with  the  Internet  and  the  Web.  'We  have  enough  users  and  enough 
data  to  make  som^ething  happen,"  Kahle  said.  ""We  can  live  to  replicate  the  Thinking  Machines  ideal." 
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2  Internet  Giants 
Won't  Show  at  Expo 

Microsoft,  Netscape  are  skipping  Chicago 


Chicago 

I:  the  Internet  industry  puts  on  a  con- 
ference and  Microsoft  and  Netscape 
Communications  don't  bother  to  show, 
■Aill  the  event  make  more  than  a  peep'' 
The  answer  is  yes  and  t,o. 

Organizers  of  Summer  Internet  World 
97.  which  starts  today,  insist  the  dynamic 
duo  of  the  Net  set  will  not  be  m.issed.  They 
expect  25. ceo  people  and  more  than  300 
companies  to  attend  the  show,  which  is  be- 
ing held  m  Chicago  for  the  first  time.  In- 
ternet World  also  is  presented  in  New 
Vor:<  in  the  winter  and  Los  Angeles  m  the 
spring. 

"It  will  take  about  a  year  to  establish 
this  show  in  the  Midwest."  said  a  spokes- 
m.an  for  Mecklerm.edia.  the  Westport. 
Conn.,  publisher  that  puts  on  Internet 
World.  "We  e.xpect  the  attendance  figures 
to  approach  the  New  York  and  L.A.  shows 
;50.COO  each)  next  year.  Rem.ember.  we  had 
to  m.ove  the  show  from  San  Jose  to  L.A.  to 
handle  the  big  crowds."' 

Seven  m.onths  ago.  Microsoft  and  Nets- 
cape were  com.peting  for  headlines  at  In- 
ternet World  in  New  York.  .And  both  also 
e.xhibited  at  the  spring  show  in  Los  .Ange- 
les. This  tim.e,  however,  they're  staying 
home,  having  concluded  that  Chicago 
amounts  to  overkill. 

Officials  of  both  companies  said  there 
are  too  many  Internet-related  shows  these 
days.  A  spokeswoman  for  Netscape  said 
the  Mountain  View  company  would  rather 
use  other  resources  —  its  online  site  and 
press  releases,  for  example  —  to  get  its 
m.essageout. 

Without  two  of  the  Net's  biggest  play- 
ers, the  show  has  the  feel  of  Chicago  with- 
out Bulls  merchandise  on  every  street  cor- 
ner 

.Mean'Ahi'.e.  other  com.panies  are  eager 
to  ::il  the  vacuum  and  grab  som.e  atten- 
tion: 

■  IBM  w'.ll  introduce  an  "Emergency 
Response  Tea.T."  of  experts  to  fend  off  on- 
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line  security  attacks  on  enterpr'-se  net- 
works. Big  Blue  will  also  offer  a  Web-host- 
mg  program,  for  sm.ailer  companies  and 
demonstrate  som.ethmg  called  the  Saa 
Francisco  Project,  a  Java  programming 
initiative. 

■  Oracle,  the  big  database  company,  is 
adding  payment  options  to  its  Internet 
com.m.erce  platform.,  making  it  easier  for 
com.panies  to  buy  software  and  services 
online. 

I  Sun  .Microsystem.s  will  showcase  the 
latest  in  Java,  its  programming  language  for 
the  Internet.  It  also  will  demionstrate  prod- 
ucts designed  for  electronic  commerce. 

■  .Apple  Computer  is  introducing  a  Ja- 
va version  of  its  popular  QuickTime  video- 
editing  program. 

■  World  .Access  is  taking  the  lens  cap 
off  KidCam.,  an  Internet  video-conferenc- 
ing system  that  allows  parents  to  monitor 
the  daily  doings  of  their  children  at  day- 
care and  preschoois. 

■  Alexa  Internet  is  announcing  a  free 
navigation  service  that  gives  users  advice 
on  where  to  surf  the  Web  for  specific  in- 
form.ation. 

■  WhoWhere?,  which  maintains  an  on- 
line directory  of  information  on  people 
and  businesses,  is  showing  a  free  e-mail 
service  called  MailCity. 

■  Maplnfo  is  hawking  an  Internet  map- 
ping program  that  allows  companies  to- 
display  the  geographic  locations  of  their 
.stores  and  offices  to  Net  users. 


Ub/U0/lSS7      ivS:  ib  4158Sbl'JS4 


ADTErjr'A   GPOUP 


The  Oakland  Press 
Pontiac,  MI 
July  23,  1997 


NEWS  BYTE 


Service  turns  page 
in  aiding  Web  users 

THE  NEW  YORK  TIMES 


SAN  FRANCISCO  —  A  3econd  gen- 
eration of  Internet  search  tools  is 

AJgxa  Internet  offers  a  new  service 
to  provi^eTliore  information  on  the 
Dopularity  and  location  of  World  Wide 
Web  pages. 

The  service  works  with  Internet 
browsers  from  Netscape 
Comniunication^  Corp.  and  Microsoft 
Corp. 

The  Alexa  ser.ice  has  close  ties  to 
the  Internet  Archive,  a  nonprofit 
org-anization  that  wants  to  maintain  a 
digital  snapshot  of  the  Web. 

The  new  ser-nce  seeks  to  monitor 
the  behavior  of  Internet  osers  system- 
atically and  assess  the  popularity  of 
particular  Web  pages. 

The  idea  is  to  inform  Internet  users 
about  the  "context"  of  the  pag«  they 
view,  as  well  aa  tips  on  where  to  look 
next,  said  Brewster  Kahle.  a  founder 
of  ThirJung  Machines  Corp. 

He  started  the  .AJexa  service  with 
Wais  Inc.,  an  information  retrieval 
compar^y  acquired  by  .\m,enca  Online. 

Aiexa  gleans  its  information  by 
studying  users'  preferences  and  ana- 
lyzing the  millions  of  links  connecting 
different  Web  pages. 

'We're  trying  to  use  the  paths  that 
people  are  leaving  through  the  Net  a« 
an  indicator  of  what  is  good,"  Kahle 
said 

The  Alexa  service  silso  retrieves 
'dead"  Web  pages  that  have  been  cap- 
tured by  the  Internet  Archive.  Such  a 
need  has  been  growing  as  information 
often  appears  and  then  vaxushes. 

The  service  —  located  at 
w\«rw.^exa.com  —  la  free.  It  will  be 
financed  by  advertisements  appearing 
inside  the  program's  display  while  the 
Int-smet  user  browses  the  Web. 

3;nce  the  service  n^onitors  inchvid- 
ual  user  behavior  on  the  Web.  Kahle 
said,  it  has  been  designed  to  guaran- 
tee user  aconvrnKy  and  protect  priva-  , 

'■'       . ._ -/I 
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•     Getting  even  more  persona):  Her 
name  is  Aiexa.  and  she  is  a  new  free  Web  navi- 
gation service  from  San  Francisco-based  Mexa 
Internet  that  learns  about  individuals'  likes  and 
dislikes  and  helps  users  determine  whether  or 
not  a  s/te  is  worth  visiting.  Alexa,  which  is  now  in 
beta,  also  offers  suggestions  about  where  a  user 
might  fmd  the  desired  information  based  on  an 
analysis  of  usage  patterns,  provides  content  and 
Site  links,  and  locates  pages  that  no  longer  are 
available  on  the  Web. 
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As  Go  Surfers,  So  Goes  Alexa 

by   Ciins     Oak.es 

5  ?3a"'    2-:  Jul  37  POT  The  Web  wants  to  tell  you  a  little  bit 
about  Itself,  At  least  that's  the  mission  of  a  new  service 
from  San  Francisco  start-up  Alexa  Internet-  A  free, 
advertising-supported   "Intelligent   navigation   service," 
Alexa  acts  as  a  browsing  companion,  providing  contextual 
site  information  via  a  slim  Windows  toolbar 

The  toolbar  software,  available  in  a  beta  version  from 
Alexa's  Web  site,  receives  and  presents  information 
independently  from  a  user's  browser  and  offers  a  range  of 
basic  data  and  site  ratings. 

"It's  a  navigation  service  that  gets  better  as  the  Web  grows 
and  as  there  are  more  users,"  said  Alexa  Internet  president 
and  co-founder  Brewster  Kahle,  who  also  founded  WAIS 
Inc.,  an  electronic  publishing  company  sold  to  America 
Online  in  1995.  "Other  people  have  found  good  things  and 
not  so  good  things,"  he  said.  "And  as  the  discernment  of 
users  gets  better,  Alexa  gets  better." 

The  "meta-data"  provided  by  Alexa  is  oriented  around  two 
questions;  Where  am  I  and  where  should  I  go  next?  The  first 
question  :s  answered  via  such  details  as  ratings  of  a  site's 
server  speed,  popularity,  and  content  freshness,  and  what 
paths  users  tend  to  follow  through  the  site.  The  second 
question  is  answered  with  a  list  of  links  to  related  sites 
others  have  frequented. 

In  drawing  conclusions  about  sites,  Alexa  bases  its  ratings 
and  recommendations  on  an  analysis  of  patterns  within  the 
Web  at  large  -  as  indicated  by  the  preponderance  of  links  to 
a  particular  site,  for  example.  It  also  builds  its  opinions  by 
tracking  the  usage  "paths"  of  other  Alexa  users. 

Jerry  f^ichalski,  managing  editor  of  industry  newsletter 
Release  1-0.  likes  what  he  sees  and  says  Alexa  has  no  direct 
parallel,  even  in  agent-oriented  sites  recommendation 
services  like   ""•■e-i\ . 

The  general  browsing  population.  Michalski  believes,  will 
take  to  it  when  it  sees  that  rather  than  an  overview  map 
provided  by  directories  like  Yahoo,  Alexa  is  a  map  with  a 
localized  structure.  "Wherever  you  happen  to  be,  it  gives 
you  the  major  roads  out,"  he  said.  "That's  pretty  useful." 

Alexa's  Web  analysis  is  based  on  a  massive  "copy  of  the 
Web"  -  an  archive  of  all  sites  that  have  existed  since  early 
1996,  Already  exceeding  5  terabytes  in  size,  the  company 
says  the  archive  is  updated  with  a  new  Web  "snapshot" 
about  every  60  days. 

But  Alexa's  techniques  raise  some  questions  about  the 
service's  mechanics  and  even  user  privacy.  For  one  thing, 
it's  a  service  that  gains  intelligence  as  it  is  used,  and  its 
ratings  can't  yet  draw  on  a  wealth  of  Alexa-tracked  "usage 
paths."  To  remedy  this.  Alexa  editors  are  manually  ensuring 
that   'top  Web  sites"  start  out  with  appropriate  ratings. 
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As  far  as  the  privacy  of  information  on  users'  paths  through 
the  Web,  Kahle  says  Alexa  doesn't  know  who  the  users  are  - 
only  what  they're  doing.  "It's  not  that  we  don't  exchange 
information  [on  a  user's  identity  and  behavior]  -  we  don't 
have  the  information  to  exchange." 


An  additional  feature  similar  to  Excite's  PAL  chat  service  or 
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AOL's  instant  messaging,  Alexa's  chat  system  can  tell  a 
browser  who's  online,  and  let  them  send  messages  in 
real-time. 

Forrester  Research  senior  analyst  Mark  Hardie  says  Alexa's 
success  depends  on  its  positioning  versus  directories  and 
search  engines.  Alexa  will  have  to  make  it  clear  that  it's  a 
navigation  hub,  educating  Web  users  that  'we're  not  about 
searching,  we're  about  helping  you  get  around." 

The  Web,  as  it  continues  to  explode  in  content,  may  require 
the  intelligence  of  a  mass  surfing  population  to  help  assess 
Its  content,  as  directories  can  only  tally  so  much.  "The 
richest  directory  that  we  know  of  points  to  less  than  1 
percent  of  all  Web  pages,'  Kahle  said.  That,  he  believes, 
leaves  a  lot  of  room  for  his  service's  approach. 
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Alexa 
Internet 
hopes  to 
change 
the  way 
you  use 
the  Net 


By  Z«ch»r>  Coil 


Building  116  on 
the  south  end  of  the 
Presidio  is  a  strange 
place  to  start  a  revo 
luUon.Thetwo-ston.' 
house  with  a  hipnwf 
and  shipslapsiding 
was  once  a  post  trad- 
er's store  and  later 
noncoinnuaaiuiicd 
officers' quarters. 

Alexa  Internet,  a 
small  Web  naviga- 
tion company, 
moved  into  thu 
budding  last  year  ;it- 
l<?rtheArm>  hanacd 
the  Picbidio  u\c.  Uj 
the  National  I'ark 
Service.  Now  the 
building,  bulk  circa 
1885,  IS  ground  zero 
for  the  company's 
new  Web  search  service,  u 
■  founders  say  could  change  ; 
people  use  the  Internet 

•The  Web  made  tveryon, 
lisher,"  says  the  company  ■=  I 
and  president  Brewster 
"Alexa  allows  ever,  one  to  be 
tor."  Or  as  co-founder  and 
manager  Bruce  Gillial 
"We're  the  first  and  oni\  'v^  • 
gation  service  that  accuall;. 
from  the  people  who  Use  ;l 

One  would  norniali;. 
that  sort  uflalk  as  bluster  c 
ly  from  a  company  with  nn  rt 
and  just  two  dozen  employe 
Kahle.adaLa  mining  special 
Gilliat  have  a  record  ot  m, 
success  out  of  companies  iiu 
pecLs  will  survive 

Alexa's  ser\  ice  appe.ir--  a 
gray  toolbar  that  can  be 
around  the  screen  sOiil' 
browse  the  Web  The  sir.  . 
to  users  and  vmII  t>e  .idvf'.;.- 
ported.  Currently  bcuiL;  •. 
those  whii've  dow  nllMl.!.-l• 
the  Web,  it  will  U.-ut!iu,dl  ■  : 
by  the  end  of  summer 

The  company  is  pi'siunii 
service asdn  alternali^c  U- 1 
directories  and  p«,pul,i:  ^^ 
gines  like  YahiMj  anil  h\'   '. 


dent  of  engine  rr- 
iry^forAIaa  In 
lernel.kneeU  be- 
fore the 

20  terahyud  igUal 
tape  machiru  •  in 
thecoripnny  's 
temperature  con- 
tmlled^yste'ris 
room.  Mcun  ling 
to  Smith,  thi  •  rna- 
chine  can  <;£i  >reall 
the  text  in  th  e  Li- 
brary of  Cor  tgress. 
with  rrjom  ^  ) 
spare. 


temative  they  claim 
will  make  it  easier  for 
users  to  find  what 
they're  loo  kjngfor. 

What  currerentiat«s 
Alexa's  s<  rrvice  i8  the 
way  it  Hn  ds  the  Infor- 
mation, h  istead  of  us- 
ing keywi  )rd  searches, 
as  most  ii  Mrch  engines 
do,  or  hin  ng  workers  to 
pick  the  t«3t  sites,  as 
many  d  i rectory  ser- 
vices do.  /  Mexa  watches 
where  Internet  users 
go  on  thf  r  Web  and  uses 
the  sum  •  )f  their  choices 
to  make  recommenda- 
tions. 

Brua  •  Gilliat 

domunai  .ralea  this  ca- 
pability   by  entenng  a 
site     on     surrealism. 
Clickini  J  on  a  "AiVhere  to 
Go"  icu  n,  he  pulls  up  a 
list  of   10  related  sittis, 
uding  Women  Su  rrealist  Artists, 
Dada.  Alberto  Giacom  etti  and  the  Sur- 
realists, and  The  Surr  ealism  Test  Cen- 
ter .^1  are  sites  thai,  users  linked  to 
from  the  onginal  site. 

These  are  sites  you  may  not  have 
found  with  a  search  engine,'  Gilliat 
said 

Some  of  the  seed  n  loney  for  .^exa  In- 
U-met  comes  from  tlahle  and  Gilliat's 
dst  venture,  WAIS  Inc.,  an  electronic 
publishing  company  they  sold  to  AOL 
for  S 15  million  in  li'<)5. 

The  pair  drew  aiuch  attention  na- 
tionwide last  fall  '.vhen  they  launched 
the  Internet  Archives,  an  ambitious  at- 
tempt to  record  thi  >  entirety  of  the  Web 
on  a  database  for  present  and  future 
scholars  and  reses  j-chers. 

The  vast  archiv  es  are  still  stored  on  a 
20-terabyte  digit  td  tape  machine  in 
Alexa's  tempers iture-controlled  sys- 
tems room.  The  machine  loots  like  a 
cross  between  a  jukebox  and  vending 
machine,  but  it .  .-iin  stcire  all  the  text  in 
the  Library  of  Congress  and  then  some. 
The  archives  also  have  a  coraxnerciaJ 
component  for /Uexa.  By  keeping  snap- 
shou  of  the  Wei )  that  are  updated  every 
SLX  weeks,  the  <  «mpany  can  find  pages 
that  have  disap  .pcared,  eliminating  the 
d readed  "404-  Mot  found"  error.  "Others 
\SeeDATA.B-7\ 
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New  revolution 
brewing  at  Presidio 

are  gathering  the  data,"  said  Z 
Smith,  vice  president  of  engineer- 
ing for  Alexa.  "The  unique  thing  is 
that  we  can  store  the  data  and  give 
it  back  to  people." 

Alexa's  service  also  features 
what  the  company  calls  "contextu- 
al navigation."  The  idea  is  that 
users  ought  to  know  a  little  back- 
ground on  the  site  they're  visiting. 

To  show  what  he  means,  Gilliat 
enters  a  site  on  a  travel  agency  in 
New  Jersey.  Knowing  nothing 
about  the  agency,  he  opens  a  box 
from  the  Alexa  toolbar  that  gives 
some  vital  details:  who  owns  the 
site,  how  many  people  have  visited 
il,  how  frequently  it  is  updated, 
how  many  Web  pages  there  are, 
how  fast  the  site's  server  is.  The 
company  also  has  a  button  where 
users  can  vote  whether  or  not  they 
liked  the  site.  "There's  no  editorial 


value  to  it,"  says  Gilliat.  "It's  just 
what  our  users  say." 

Alexa  users  can  also  click  on  a 
box  that  lets  them  send  messages 
to  friends  who  are  on-line,  similar 
to  services  like  ExcitePal,  iChat's 
iPage  and  AOL's  Buddy  List 

Kahle  said  the  company's  ser- 
vice can  be  used  along  with  other 
search  services.  "You  still  need 
some  medium  to  get  started,  like  a 
URL  or  a  search  engine,"  he  said. 
But  once  people  get  to  the  site  and 
start  using  Alexa's  sen.'ice  "they 
may  leave  and  not  necessarily 
come  back." 

Joe  Krause,  senior  vice  presi- 
dent of  marketing  and  co-founder 
of  Excite,  said  Alexa  may  be  the 
first  to  try  such  a  service,  but  that 
all  the  major  navigation  services 
have  the  technology'  to  offer  users 
similar  features. 

Krause  said  that  where  Alexa 
might  face  an  uphill  battle  is  in 
distributing  its  service.  "The  key  in 
our  business  is  distribution.  Get- 
ting eyeballs  for  your  service  is  the 


most  important  thing,"  he  said. 
Excite  has  relationships  with  Mi- 
crosoft, Netscape  and  America  On- 
line —  the  kind  of  partnerships 
Krause  said  Alexa  may  have  trou- 
ble negotiating. 

Still,  Gilliat  said  advertisers   ; 
have  expressed  interest  in  Alexa    ! 
because  it  allows  them  to  target    | 
users  by  their  interest.  "Mobil,  for    : 
example,    can    say,    'Wherever    I 
t>iere's  automotive  site,  put  our  ad 
up  there',"  he  said.  "The  ads  are 
content-sensitive  to  what  you're 
locu'sing  at."  j 

The  company  is  allowing  10,000    i 
people  try  the  beta  version  of  the    j 
servi4;e.  "We  have  to  get  feedback    I 
and  users  and  make  sure  our  ser- 
vice works,"  said  Gilliat.  ■   ' 

And  if  the  company  is  success- 
ful, vrill  the  founders  sell  again?  j 
"We  did  WAIS  and  sold  it  for  i 
$15  million."  he  said.  "We  want  to 
do  thin|:s  differently  this  time.  We  , 
don't  v/ant  to  sell  it  right  now. 
Maybe  ^ve'll  have  an  IPO  at  a  later 
time." 
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New  service  tracks 
Web  use 

SAN  FP^.NCisco  —  As  a  se£-: 
end  generation  of  inferret  search 
toois  emerges  a  company  caite 
Aiexa  internet  rras  startec  a  serv^c 
offenng  users  rrxjre  informatK^'' 
ab-out  the  pcpufanry  and  iocatfont'* 
ttie  pages  they  view. 

Trte  si&CAce.  whicri  wcrte  ^ttftT- 
Irtterret  brcArsers  cffer9<i  tv  ;'..' 
Netscape  C-ommuruancr^,  Cotp^ 
and  Vlicrosoft  Ccru  ,  ?ocusss  on-tJ 
r9fat]ve,'Y  new  idea  ot  ass-sfiig;,';, 
users  in  navigating  the  Worftf^^ 
Web. 

ThaAfexa  senw©  stTenT^^s? 
monitor  rtie  behavior  of  inierDeiS 
users  systamalicafly,  ^Te  bets 
rstum-  informatiorii  orr^  i 
of' parficufar  Web  pages--Tf»^ 
^isto  pfovicfelntemet  users,  irt^:^ 
seme  idea  of  the  "contexf  ' 
.  page- they  viev^  as  wefl  as  trp^ 
-where  to  rool<  nexL  '■';^-  -""-_ 
'  ■'  This"ihforTriatiori  ©marges  boUv'^ 
,  itom  the  preferences  of  otfw^ 
users  and  from  analysis  the ; 
has  done  about  tfie  miJlJons  of  H 
.  connecting  different  Web  pages. 

Because  the  Aiexa  ser/ice.  foof 
ed  at  wvAv.atex^.com,  is  rree,  ft : 
will  be  finartced  by  advertis«mer 
that  acpear  inside  the  program's:"' 
disptay  whife  the  Internet 'jsef    - 
Drowses  the  Web.  -;'i 

{JOHN  MAHKOFF/ 
N.  Y.  Times  News  Service)       -'■ 
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Net  growth 

virtually 

exploding 

■  The  swelling 
digital  bubble  of  Net 
and  Web  use  in  no 
danger  of  bursting, 
say  enthusiasts 

Bv  Jon  Swartz 

Sar  rnncsco  C^no/iicie 

CHICAGO  —  The  Internet  s 
meteoric  rise  is  far  from  over 
The  party  is  just  beginniiig. 

Those  were  the  unfettered  ob- 
servatioris  of  revelers  at  the 
Smnmer  Internet  World  97  show 
here  this  week.  They  say  the  2- 
year-old  industry  is  not  even 
close  to  peaking  in  populanrr'  — 
in  fact,  its  on  the  cusp  of  an- 
other boom  period. 

"We  see  no  end  m  sight."  sa:c 
Aian  Taffel,  vice  president  or 
marketing  and  business  devel- 
opment at  Uunet  Technologies 
Inc..  a  Fairfax,  Va.,  Internet  ser- 
vice provider.  "This  is  just  the 
start  of  a  long-term  phenome- 
non." 

The  biggest  opportunity  in- 
dustry observers  said,  is  in  bi? 
business,  -Arhere  many  corpora- 
tions still  rely  on  leased  lines 
and  public  data  services  for  com- 
munications. Many  are  just  now 
pornmg  to  the  Internet,  after 
bdlkmg  because  of  concerns 
over  online  security  and  quality' 
of  ser/ice. 

Others  sajd  the  integration  of 
"killer  apps"  —  software  pro- 
grams that  improve  video,  sound 
and  fax  services  on  the  Net  — 
•A-iIl  fuel  even  greater  growth  ir. 
tne  corporate  and  consumer 
markets. 

In  a  keynote  speech  or. 
Wednesday  Novell  Inc.  chief  ex- 
ecutive Erie  Schmidt  said  the 
Wf»h    a-i!l    nrovifip    rnTnn;in^p- 


Oiaries  BenneWAP 


Rick  Jackson  of  Digrtal  Knovledge  Assets  works  at  a  bank  of  conv  ( 
puters  at  the  Summer  Internet  World  '97  on  Wednesday  in  Chicago.  '^ 
E-commerce  is  the  buzzword  at  this  year's  show 


with  a  'serv-ice  model '  for  "ir.- 
formation  utilities  ' 

.\il  told,  Lhe  momentum  of  ->.e 
Ne:  appears  to  be  'ar.stoppab.^. 
iaid  show^oers  at  Lnternet  Worid. 

'The  rite  of  change  on  'he  In- 
ternet is  like  dog  years.  For 
p'.-t?rv-  ye^r  in  the  PC  market  -.t  i 
like  seven  on  the  Interne:,'  said 
Bre-'A'sterKahle.  chief  e.xecutive 
of  San  Frar.ciico-baied  .\lexa  In- 
ternet, one  of  300  exhi'oTtors  at 
tb.e  shew.  "It  is  a  bl3tenn?  pace.  ' 

'.■•-net  has  come  v.;}  with  :*: 
O'ATi  phrase  to  describe  the  w.cr- 
spread  acceptance  ^jf  the  N'er 
'L"'ernet  Law" 

Based    on    '  Moore  j    Law 
w'nich  states  chat  the  perfor- 
mance of  1  personal  com.pur^r 


the  same  price.  "Internet  Law" 
claims  that  'he  data-transmis- 
sion :apscity  'iem,2nds  of  con- 
5-u,mers  aoucies  :very  three  to 
fo'-ir  months 

•Qur  network  has  to  ?row  by 
a  factor  cf  10  ever^'  year."  Taffel 
said  What  o»h?r  industry  Is 
even  close  ro  that  nte'^" 

The  internet  5  rapid  growth 
has  ■'■^::^,'i  industry  heavy- 
weishr?  ^acn  as  IBM  Corp.,  Mi- 
"i^snr.  C  7rr  ^,nd  Hewlett- 
ric.-i'^ci  ■:-  -0  scramble  to 
w-3ve  :.".e  '-V^o  mto  their  sprawl- 
ing'3'.o.."^s?  jceraticns. 

Thi  Ln--^rn^t  evolution  is  cr&- 
a'lnc  1  tis_ness  revolution." 
L-jiu  ur:-  Hmm.^n,  Internet 
b U-- : n  =  -  •  zz  v s . o p m*: nt  mana ger 
St  Filr  A-,-ois^'lHP 


Jiftf'ribidya 


^^JTE^J^JA  gpoup 
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Charles  Liberty,  a  marketing  in  iivager  for  >lassachu*«tt«-based  Digital  Equipment  Corp.,  eiyoyi 
the  Qrst  day  of  Internet  Worlc  Summer  '97,  the  industry  trade  show  at  Chicago's  McCormicX 
Place.  'This  year  and  next  are  probably  the  years  for  Internet  commerce  to  take  off,"  he  said 
of  an  E^ontmerce  boom  that  -^tperts  say  may  generate  $6  billion  In  revenue  annuaiiy  by  the 
year  2000. 

E-commeice  about  to  launch? 


By  JON  SWARTZ 

S«n  Fr»ncl»co  Cn/enieio 


CHICAGO  —  If  the  Interrvet  industry  puts  on  a 
>:onference  and  Microsoft  ar.d  Netscape 
Communications  don't  bother  to  show.  wiU  the 
evtnt  make  more  than  a  peep"" 

The  ans'.ver  is  yes  ar.d  no. 

Organizers  of  Summer  internet  Worfd  97,  that 
started  Wednesday,  insist  the  dynairuc  duo  of  the 
Net  set  will  not  be  missed.  They  expect  25,000 
people  and  more  than  300  companies  to  atrend 
the  show,  which  is  being  held  in  Chicago  for  the 
Hrst  time.  Internet  World  also  is  presented  m 
New  York  in  the  ^^lnter  and  Los  .\ngeles  in  the 
spnng. 

*Il  will  tajce  about  a  year  to  estabhsh  this 
show  in  the  Midwest*  said  a  spokesman  for 
Mecklermedia,  the  Wostport.  Conn.,  publisher 
that  puts  on  Internet  World  "We  expect  the 
attendance  figures  to  approach  the  New  York 
and  La  shows  (50  000  each)  next  year 
Remember,  we  had  to  move  the  show  from  San 
Jose  to  La.  CO  handle  che  big  crowds." 

Ser-en  months  ago,  -Microsoft  and  Netscape 
were  competing  for  headlines  at  Internet  World 
m  New  York.  And  both  also  exhibited  at  the 
spnng  shew  in  Los  Angelei.  This  ame,  hov^-ever 
they're  staying  home,  hatnng  concluded  that 
Chicago  amounts  to  overkili. 

OtTiaais  of  both  companies  said  th.ere  are  too 
many  Internet-related  sho-w^  these  days.  A 
spokeswoman   .'or  Netscape  ssad   the  .Mountai.T 


View  company  would  rather  -ase  other  resources 

—  Its  online  site  and  press  releases,  for  example 

-  to  get  its  message  out 

Without  two  of  the  Net's  biggest  players,  the 
show  has  the  feel  of  Chicago  without  BuHs  mer- 
chandise on  every  street  comer. 

Meanwhile,  other  companies  are  eager  to  fill 
the  vacuum  and  grab  some  attention; 

•  IBM  will  introduce  an  'Emergency 
Response  Team'  of  experts  to  fend  off  onlir« 
security  attacks  on  enterprise  networks.  Big 
Blue  will  also  offer  a  Web-hoscmg  program  for 
smaller  companies  and  demonstrate  something 
called  the  San  Francisco  Project,  a  .Java  pro- 
gramirung  uutiadve. 

•  Oracle,  the  big  database  company.  LS  adding 
payment  options  to  its  Int^m.et  c-jmrrerce  plat- 
form, making  it  easier  for  companies  to  b'jy  soft- 
ware and  ser-lces  orUine. 

•  Sun  Microsystems  will  showcase  the  latest 
in  Java,  its  pn)gramir,g  language  for  the  Internet. 
It  also  will  demonstrate  products  designed  fisr 
electronic  commerce. 

•  Apple  Computer  is  introdudr^  a  Java  ver 
sion  of  Its  popular  QuickTime  video-editing  pro- 
gram. 

•  World  Access  is  taking  the  lens  cap  off 
KidCam,  an  Internet  vjdeo-corierencing  system 
that  allows  parents  to  morutor  the  daily  doings 
of  their  children  at  day-care  and  preschools. 

«  Aiexa  Internet  is  announcing  a  free  naviga- 
tion service  that  gives  users  adMce  on  where  to 
surf  the  Web  for  specific  information. 
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Officials  see  no  end  in  sight 
for  the  popularity  of  Internet 


By  Jon  Swartz 

Seiri  Franasco  Chronicte 

CHICAGO  —  The  Iftternct's 
meteoric  rue  is  far  from  over 
The  party  15  Just  beginning. 

Those  were  the  unfettered  ob- 
servatjons  of  revelers  at  the 
Summer  Internet  World  97  show 
here  this  week. 

They  say  the  2-year-old  indus- 
try is  not  even  close  to  peaking 
in  popularity  —  in  fact,  it's  on 
the  cusp  of  another  boom  period 

"We  see  no  end  in  sight,"  said 
Alan  Taffel,  vice  president  of 
marketing  and  business  develop- 
ment at  Uunet  Technologies  Inc., 
a  Fairfax,  Va,  Internet  service 
provider.  This  is  just  the  start  of 
a  long-term  phenomenon." 

The  biggest  opportunity,  indus- 
try observers  said,  is  in  big  busi- 
ness, where  many  corporations 
still  rely  on  leased  lines  and 
public  data  services  for 

communications. 
Many  are 
Just  now  t\im- 
ing  to  the  Inter- 
net, after  balk- 
ing because 
of  concerns 
h-^-r'?T^^5^-5%<-..over  online 
^^^nJs:^^^^  security 
■^">vV-r^^      and  quality  , 
of  service. 
Others  said 
the  integration  of 
"kiHer  apps,"  —  software  pro- 
grams that  improve  video,  sound 


and  fax  services  on  the  Net  — 
will  Aiel  even  greater  growth  in 
the  corporate  and  consumer 
markets. 

In  a  keynote  speech.  Novell 
Inc.  chief  executive  Enc  Schmidt 
said  the  Web  win  provide  com- 
panies with  a  "service  model" 
for  "information  ublities." 

All  told,  the  momentum  of  the 
Net  appears  to  be  unstoppable, 
said  show  goers  at  Internet 
World. 

The  rate  of  change  on  the  In- 
ternet is  like  dog  years.  For  ev- 
ery year  in  the  PC  market  it's 
like  seven  on  the  Internet"  said 
Brewster  Kahle,  chief  executive 
of  San  FrancLSCo-based  Aiexa  In- 
ternet one  of  300  exhibitors  at 
the  show    'It  is  a  blistenng 
pace." 

Uunet  has  come  up  with  its 
own  phrase  to  describe  the  wide- 
spread acceptance  of  the  Net 
"Internet  Law." 

Based  on  'Moore's  Law," 
which  states  that  the  perfor- 
mance of  a  personal  computer 
will  double  every  18  months  at 
the  same  pnce,  "Internet  Law" 
claims  that  the  data-transmission 
capacity  demands  of  consumers 
doubles  every  three  to  four 
months. 

'Our  network  has  to  grow  by  a 
factor  o(  10  e\-ery  year,"  Taffel 
said    'What  other  industry  is 
even  close  to  that  rate''"  " 
The  Internets  rapid  growth 


has  forced  industry  heavy- 
weights such  as  IBM  Corp., 
Microsoft  Corp-  and  Hewlett- 
Packard  Co.  to  scramble  to 
weave  the  Web  mto  their  sprawl- 
ing business  operations. 

The  Internet  evolution  is  cre- 
ating a  business  revolution,"  said 
Dave  Hmman,  Internet  business 
development  manager  at  Palo 
Alto-based  HP 

To  harness  the  runaway 
growth,  HP  is  in  the  process  of 
Web-iiying  its  business  model  so 
that  its  pnnter,  PC  and  server  di- 
visions are  working  In  sync  with 
its  Net  activities.  The  company  is 
also  stepping  up  efforts  to  mar- 
ket Internet  products  and  ser- 
vices, Hmman  said,  althou^  he 

declined  to  say  how. 

Another  indication  of  the 
growth  of  the  Net  is  that  spend- 
ing on  Web  ads  more  than  dou- 
bled in  the  first  half  of  this  year, 
to  $217.3  miUion.  according  to 
the  newsletter  Electronic  Adver- 
tising &  MarteqDlace  Report 

Even  the  Internet  trade-show 
industry  aeems  to  be  thriving. 
Mecklermedia,  the  Westport 
Conn.,  OTTianizer  of  this  week's 
show,  announced  plans  to 
launch  a  Web  conference  for 
consumers  next  Januaiy  in 
Boston. 

It  now  runs  four  Internet 
shows  a  year  —  twice  as  many 
annually  as  Comdex  and  Mac- 
world Expo. 
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Alexa  Internet  —  One  of  the  better  tools  we've  seen  for  Web  iurfing  lately  is  Alexa  -  a  browser  aid  now  in  beta.  AJexa 
gives  users  a  he^ds  up  on  how  many  pages  are  on  the  site,  who  owns  it,  and  a  relative  gauge  of  bow  popular  a  site  is  by 
how  many  pages  oiit  on  the  Web  refer  to  it.  In  addiocn,  if  you  go  to  a  link  that  leads  to  a  dead  page,  chances  are  Alexa 
may  have  recorded  a  copy  of  it.  Finally,  Alexa  serves  an  agent  function;  Because  it  tas  access  to  a  data  mine  of  path 
information,  it's  able  to  make  suggestions  based  on  the  'T)caten  oaths"  left  by  other  surfers  who  ve  hit  the  same  sites. 
Finally,  users  can  rate  sites  diemselves  and  communicate  Mxh  omer  Alexa  users.  The  company  jiist  started  trickling  out 
betas  of  Its  software  last  week,  and  SO  far  we  lilce  iL  It  allows  site  seers  to  get  their  bearings  on  the  Web  in  what  struck 
U5  as  a  new  and  user-etnpowenng  way. 
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Alexa  Keeps  Surfers  On  Beaten  Path 


By  Tom  SteLnert-Threlkeld 

With  640,000  sites  and  counnng, 
the  Web  remains  a  tangled  knot  ot 
ijitbrmation  for  manv  visitors. 

In  the  eves  ot  Brewster  KarJe.  thie 
rhitest  wav  to  get  users  where  the'/'re 
going  mjv  be  to  use  a  reiativeiv 
untapped  set  of  Net  e.xperj 

—  other 'A'eb  ••isitors. 

Kahie      helped     star: 
Thinking  Machines  Corr. 

—  which  riffured  out  a  w^v 
to  tie  togedier  thousands  of 
processors  to  create  power- 
hil  computing  engines  — 
and  invented  the  pioneer- 
ing N'et  pubiishiing  service 
known  as  the  ^v'ide  .\rea 
Informanon  Sep/er  s^vsterr.. 
Now  he  has  turned  h:s 
acrennon  :o  creating  a  ne'.s- 
means  of  adding  relevance 
and  intelligence  to  naviga- 
tion of  the  ^Veb. 

He  is  launching  a  sep.ice  to  offer 
Vv'eb  users  details  about  each  site  thev 
'visit  and  to  recommend  other  sites, 
based  on  analvsis  of  the  surrlng  pat- 
terns of  other  visitors. 

The  ne'.v  service,  called  .^de.xa. 
appears  mside  a  tool  bar  th.at  attacne^ 
itself  to  the  bottom  of  a  user  ^  browser. 

■'Screen  real  estate  is  certain]'.'  die 
most  valuable  real  estate  in  the  '.vorld." 
Kahle  said.  "Vv'e  have  to  be  '.vortr. 
more  than  real  estate  in  Tok->.o." 

To  achieve  such  worth,  Kahle'^ 
.Aie.-ca  Internet  Corp.,  based  in  San 
Francisco,  has  been  building  a  huge 
archive  o:  Internet  pages.  From  "he 
summer  or  1990  to  the  present,  .-^Jexa 
has  sucked  in  anv  page  that  has  ap- 
peared on  th,e  'A'eb,  building  a  reposi- 
top/'  ot't)  r.-iiiion  bvtes  of  information. 

The  cache  '.viU  allow  the  com  pa- 


nned 
ut-of- 


-ured 


and  financial  filings,  that  ■•viil  allow 
Users  to  bemer  iudge  the  cuaiicy  of  the 
site,  .\le.xa  '.vill  also  autom.a-colly  gen- 
erate ratings  oi  die  sites  bai<?d  on  data 
collected  rromi  'users. 

Sut  does  the  N'et  need  ar.other  rat- 
'Ir.g  -er-ice^  N'a'.igarion  ser.-.ces.  such  a^ 
".■^.L  -e  TOm  Yahoo;  Inc.  sr.c  L'-;c:te  Lr.c. 


r.^U 


e-rui  sear:.-. 
ir.c  director-'  services.  .-_most  an--' 
".;'.-■>  5er.'ice  of  jn\'  .-ep'-te  provides 
recommendations  or  related  luiiss. 
Pieces  of  sofr.vare  cailec  agents  are 
being  retined  to  let  users  predetermine 
'.'.nat  -aiform.ation  thev  war.:  :o  retrle-.e 
ir.d  -jien  coilec  it  autom.ar.cailv 

'It  seems  like  som.e:.-„ng  from. 
r\\(:  -."ears  ago, '  sa'.^  Peter  xrasilovsk".'. 
senior  ana-'-'s:  at  .\-len 
Communicaticns  Inc.  ■". 
Bethesda.  Mc.  'It  seem.s 
like  thev're  assumang 
Internet  users  need  a  ver-' 
strong  tool  to  r.elp  them, 
find  things." 

The  browser  add-on 
will  be  free  to  the  first 
10,000  users  ■■'•no  request 
it.  Reiiistration  :o  receive 
be:a  copies  began  Julv  1~ 
at  the  com.panv's  site 
___^^       iwww.ale.xa.com.  . 

.\le.xa  Internet  wul  sup- 
port itself  duough  ads  luixed  to  fea- 
rures  of  the  tool  bar.  But  Kaiile  sa'.-s  no 
advertisers  have  vet  been  >igned. 

Kahle  founded  the  com.panv 
along  with  Bill  Dunn.a  for—.eroriir.e 
>erMces  executi'.e  at  D<5vv  'ones  >5<  Co. 
The  com.panv,  started  in  .-.p'll  l^^^o, 
'v'ceived  55  mulion  in  n_nc_ng  trom. 
Ftoile  investm.ents  of  SvMtzeriana. 

Kahle  sold  his  previous  venture. 
\vAIS  Inc.,  to  .\merica  Online  Inc. 
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The  Internet  Boom 
Is  Just  Beginning 

Big  business  is  the  next  frontier, 
attendees  at  Chicago  trade  show  sa\- 
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Alexa  searches  while  you  surf 

By  Jim  Kers'.ette-.  PC  Week  Online 

A  new  San  Francisco  company  is  putting  a  fresh  spin  on  search 
technology:  Search  as  you  go. 

Alexa  Internet  Inc.  last  week  launched  beta  trials  of  a  free  service  that, 
instead  of  giving  users  a  static  search  results  page,  suggests  places  to 
go  as  users  make  their  way  around  the  Web. 

The  service  provides  background  information  about  a  site  a  user  is 
visiting  and  suggests  where  a  user  may  want  to  go  based  on  an 
analysis  of  Web  surfing  patterns,  content  and  linkages. 

Alexa  developers,  some  of  them  former  employees  of  parallel 
processing  machine  maker  Thinking  Machines  Corp.,  have  spent  the 
last  year  essentially  cataloging  the  Web,  drawing  comparisons 
between  sites  and  determining  where  users  are  likely  to  go  while 
moving  around. 

Users  navigate  the  search  results  with  the  Alexa  Toolbar,  which  is 
available  for  download  at  www.alexa.com  for  Web  browsers  that  run 
on  Windows  95  or  Windows  NT. 

The  service  draws  information  for  four  basic  features:  background 
information  on  the  current  site,  Web  surfing  suggestions,  a  cached 
collection  of  dead  Web  sites  stored  on  Alexa's  Solaris  servers  and 
chat  groups. 


m 
# 


■mx)Sf 


The  beta  period,  which  started  last  week,  is  limited  to  the  first  10,000 
users  to  register  and  download  the  client. 

Alexa  is  relying  on  advertising  for  its  revenue  and  should  take  the 
service  to  general  release  early  next  year. 

"This  IS  a  pretty  nifty  idea,"  said  Dan  Chen,  an  IT  manager  at  a  San 
Francisco  company.  "I  hope  they  come  up  with  enough  advertising  to 
keep  it  going." 


O 


TOP  OF  PAGE 


-AZDNetSite- 


JITE  GUIDE  FIND  T 


C,>p\n^ht  199-  Ziff-Diius  liK  Ml  Ri^ht^  Rese^.tJ  R^pmduaum  m  sOu>U  .t  m  pan  m  urn  fo 
■.vntten  permisunn  ,,f  Z:rf-Dtnis  Inc  is  proluhiieJ  PC  Week  ami  PC  '■Veek  Online  and  -lie  reipe 
Ziff-Davis  Inc 


ir  medii 


ithoitl  express 
■ademarks  of 


v3  zclnei  com/ocweek 


r*CWor-Mr-u.n         wr         ^^r\r-yjn*Atc        v^vyivifuiirNU 


JULY  28,  1997  •  Volume  14.  Number  32 


BREAKING  STORIES:  wvrw.pcweek.com 


NEWS 


Alexa  Searches 
While  You  Surf 

BY  Jiy  KERSTETTM 

A  NEW  Sa.N  FR.ANCISCO  COMPa.W  IS  PUTTING 

a  fresh  spin  on  search  technology:  Search 
as  you  ^0. 

Ale.xa  Internet  Inc.  last  '.'.eek  launched 
beta  tnalsofa  tree  semes  that,  instead  of 
giving  users  a  static  search  results  page,  sug- 
gests places  to  go  as  users  make  their  way 
around  the  Web. 

The  service  provides  background  infor- 
mation about  a  site  a  user  is  visitmg  and 
suggests  where  a  user  may  want  to  go  based 
on  an  analysis  of  Web  suning  patterns,  con- 
tent and  linkages. 

Ale.xa  developers,  some  of  them  for- 
mer employees  of  parallel  processing  ma- 
chine maker  Thinking  Machines  Corp.. 
have  spent  the  last  year  essentially  cata- 
loging the  Web.  drawing  comparisons  be- 
tween sites  and  determining  where  users 
are  likely  to  go  while  moving  around. 

Users  navigate  the  search  results  wuh  the 
Alexa  Toolbar,  which  is  available  tor  down- 
load at  www.aJexa.com  tor  Web  browsers 
that  run  on  Windows  95  or  Windows  NT. 

The  service  draws  information  for  four 
basic  features:  background  information  on 
the  current  site.  Web  surfing  suggestions, 
a  cached  collection  of  dead  Web  sites  stored 
on  Alexa's  Solaris  servers  and  chat  groups. 

The  beta  penod.  which  started  last  week, 
is  limited  to  the  first  10,000  users  to  regis- 
ter and  download  the  client. 

.Alexa  IS  relying  on  adv  emsing  for  its  rev- 
enue  and  should  take  the  service  to  gen- 
eral release  early  next  year 

"This  IS  a  pretty  nitty  idea."  said  Dan 
Cr.en,  an  IT  manager  at  a  San  Francisco 
company.  "I  hope  they  come  up  with 
enough  advertising  to  keep  it  gomg."  < 
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EXTRAI  Highlights  from 
Internet  World  Summer 
Show 

By  Richard  Wiggins 


Internet  Outlook  is  a  biweekly  column 
exploring  Internet  trends;  what's  happening  with 
Internet  technologies,  and  how  individuals, 
businesses,  and  government  are  coping  with 
changes  that  occur  in  "Internet  tune." 


Chicago  hosted  most  of  the  major  players  in  Internet  technology  last  week  when 
Mecklermedia  brought  its  Internet  World  trade  show  to  the  Windy  City. 
Mecklermedia  hosts  Internet  World  trade  shows  on  the  East  Coast  of  the  US  in 
Fall  (usually  New  York  City)  and  the  West  Coast  m  Spnng  (usually  Los  Angeles). 
Last  week's  show  was  the  inaugural  summer  Internet  World,  presumably  to  be  held  in 
Chicago  each  summer  for  a  primarily  midwest  audience.  The  show  was  held  in  the 
sparkling  new  North  Building  of  McCormick  Place. 

Microsoft  Alliance  Leads  to  Free  RealAudio  /  RealVideo 
Servers 

During  Internet  World,  Progressive  Networks  announced  that  Microsoft  will  invest  in  the 
company  and  will  license  RealAudio  and  RealVideo  technology  for  use  in  its  Netshow 
product.  Netshow  made  a  splash  this  summer  when  Bill  Gates  gave  the  kevnote  address  at 
the  Special  Libranes  Association  meeting  in  Seattle.  The  alliance  with  Progressive  means 
that  Netshow  will  be  based  on  the  industry  leader's  technology  instead  of  some  new 
Microsoft  concoction. 

Paul  Thelen,  Progressive's  marketing  manager  for  RealVideo,  told  me  that  Microsoft  also 
will  install  RealAudio  and  RealVideo  server  capability  into  its  new  Site  Server  releases. 
At  the  same  time,  Progressive  announced  that  its  streaming  server  will  now  be  available 
for  ft-ee  for  low-volume  applications:  any  company  can  use  one  60-stream  server  license 
without  charge. 

Finally,  Microsoft  will  bundle  RealVideo  player  software  in  all  distnbutions  of  Internet 
Explorer  --  including  when  the  Explorer  is  pre-installed  on  PCs  for  retail  sale.  Thelen  says 
some  users  complain  about  the  overhead  of  having  to  install  Real  players,  "and  this  means 
ubiquitous  player  distribution." 

Progressive  thus  enhances  its  role  as  market  leader  on  both  the  server  side  and  the 
desktop.  On  the  server  side,  any  customer  using  Microsoft  server  software  already  has 
Progressive's  technology  ready  to  use.  Any  low  volume  site,  w  hich  previously  may  have 
relied  on  the  infenor  alternative  of  pseudo-streaming,  now  serves  for  ft-ee.  And  on  the 
desktop  side,  users  can  tap  into  Real  formats  without  downloading  and  configuring  a 
plugin.  The  net  effect  of  Microsoft's  moves  solidifies  Progressive's  already  formidable 
position  as  leader  in  the  streaming  multimedia  arena. 

A  Rival  to  CU-SeeMe? 
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CU-SeeMe  was  vvntten  at  Cornell  University  about  five  years  ago  by  Tim  Dorcey.  Dorcey 
was  working  as  a  statistician  at  Cornell  (he  has  a  master's  in  statistics  and  a  master's  in 
psychology)  and  was  challenged  by  his  boss  to  write  a  program  that  took  advantage  of 
new  Macintosh  video  capability,  o'ffenng  desktop  videoconferencmg  via  the  Internet. 

Dorcey  accepted  the  challenge.  Even  though  CU-SeeMe  was  his  first  senous 
programming  effort,  it  became  wildly  popular.  Eventually  a  Windows  version  was 
offered,  and  Cornell  commercialized  CU-SeeMe  through  White  Pine  Software. 

Now,  Dorcey  comes  forth  with  iVisit,  marketed  by  a  California  firm,  Boxtop.  Dorcey  says 
his  second  generation  Internet  video  conferencing  product  is  far  better  tuned  to  the 
Internet.  In  an  mterview  he  touted  the  fact  that  iVisit  does  not  require  refiectors  for  groups 
to  set  up  a  conference;  he's  developed  a  relay  architecture  that  forwards  audio  and  video 
bitstreams  efficiently  around  a  virtual  circle  of  participants.  This  would  make  it  easy  for 
anyone  to  form  a  group  of  participants  without  having  to  install  reflector  software. 

Dorcey  also  says  he's  made  the  problem  of  finding  groups  of  people  to  communicate  with 
"much  more  like  the  ways  in  which  people  meet  people  in  real  life."  With  CU-SeeMe, 
finding  people  is  accomplished  primarily  through  non-CU-SeeMe  channels,  such  as  Web 
pages,  Usenet  postings,  and  the  Fourl  1  service.  If  Dorcey's  claims  are  true,  a  product  that 
performs  better  than  CU-SeeMe,  that  doesn't  require  reflectors,  and  solves  the  problem  of 
finding  people  and  affinity  groups  will  prove  formidable  competition  for  CU-SeeMe. 

Alexa:  Merging  Web  Searching  and  Group  Experiences 

Also  announced  in  time  for  Internet  World  was  Alexa  Internet,  "the  Web  navigation 
services  that  learns  from  people."  Alexa  is  the  brainchild  of  Brewster  Kahle,  best  known 
for  having  invented  Wide  Area  Information  Servers,  or  WAIS,  when  he  worked  for  a 
supercomputer  vendor.  Thinking  Machines  Corporation. 

Alexa  is  service  that  allows  the  Web  surfing  experiences  of  many  users  to  be  tracked 
centrally.  The  Alexa  search  engine  takes  into  account  how  popular  sites  are  when 
weighting  results  of  searches  to  present  to  users.  In  order  to  use  Alexa,  you  must 
download  the  Alexa  client  program,  which  works  in  conjunction  with  your  Web  browser. 
Kahle  told  me  they  experimented  with  Java  for  this  function,  but  found  they  couldn't  build 
an  applet  that  performed  as  well  as  compiled  client  code. 

The  product  was  announced  this  month  but  client  programs  are  being  distributed  on 
controlled  release  while  the  Alexa  staff  adds  robustness  to  the  service.  Kahle 
demonstrated  Alexa  for  me  showing  how  individuals'  surfing  habits  are  added  to  the 
central  Alexa  database.  Kahle  emphasized  that  personal  information  is  not  gathered  by  the 
Alexa  service:  "We  couldn't  even  honor  a  court  order  to  reveal  what  your  favorite  sites  are 
because  we  don't  gather  that  data.  We  gather  aggregate  data,  not  individual  data." 

Alexa  ties  into  another  Kahle  venture,  the  Internet  .Archive.  This  non-profit  undertaking 
seeks  to  build  an  archive  of  all  of  the  content  on  the  Web  (and  on  Usenet)  on  a  regular 
basis.  Kahle  told  me  his  motivation  is  "capturing  our  digital  hentage  for  future 
generations."  He  says  they  now  have  several  terabytes  of  data  in  the  Archive.  In  general, 
the  Archive  is  not  open  for  mass  exploration;  however,  a  few  scholars  are  already 
examining  its  contents. 

Sometimes  Web  sites  expenence  meltdowns  of  one  sort  or  another.  For  instance,  when  the 
Heaven's  Gate  mass  suicide  became  known,  the  cult's  Web  site  was  so  overloaded  that 
pages  were  inaccessible.  In  other  cases,  pages  or  entire  sites  may  go  offline  permanently. 
.Alexa  will  have  access  to  the  Internet  .Archive.  Kahle  says  the  Alexa  user  will  not  see  the 
dreaded  Error  404  for  a  page  that's  no  longer  accessible;  instead,  he  or  she  will  see  a  copy 
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from  the  Internet  .\rchive. 

The  group  surfing  concept  is  a  fascinating  one.  It's  been  discussed  in  vanous  forms  for  as 
long  as  we've  had  a  Web.  but  to  my  knowledge  this  is  the  first  serious  attempt  at 
implementing  the  concept.  I  fear  that  there  may  be  significant  obstacles  for  Alexa  to 
overcome:  people  don't  want  to  install  yet  another  client,  and  people  will  need  a 
compelling  reason  to  try  yet  another  search  engine.  But  if  Alexa  delivers  on  the  promise 
of  getting  people  the  content  they  really  want  or  need,  its  future  is  bright. 

Digex:  A  Regional  ISP  Steps  Up  to  the  National  Stage 

.Among  Internet  Service  Providers  at  the  show,  Digex  made  the  biggest  splash.  Anxious  to 
shed  their  image  as  a  D.C.  area  ISP,  Digex  offered  an  imposing  booth,  ran  large  ads  in  the 
show  brochure,  and  held  a  press  conference  to  tout  their  new  strength  after  merging  this 
month  with  Intermedia.  Digex  brags  about  99.9%  uptime,  redundancy,  and  the  fact  that 
they  only  use  Cisco  routers.  Obviously  they  feel  that  businesses  will  increasingly  want 
ISPs  that  offer  high  reliability  and  good  throughput,  and  they  want  business  Internet 
connectivity  buyers  to  think  of  Digex  as  leaders. 

I  chatted  with  Clyde  Heitzelman,  president  of  Digex'  Business  Internet  Connectivity 
Group,  about  the  role  of  ISPs  in  the  fliture.  Specifically  I  asked  if  telephone  companies 
won't  eventually  wipe  out  a  lot  of  ISPs,  once  they  figure  out  how  to  offer  good  Internet 
services,  simply  because  they  own  the  phone  lines.  Heitzelman,  whose  worked  many 
years  for  major  telephone  players,  said  that  regulators  will  require  phone  companies  to 
offer  bandwidth  to  their  Internet  business  units  at  the  same  prices  external  ISPs  pay,  and 
that  external  ISPs  like  Digex  will  be  able  to  compete  effectively. 

Are  Internet  Trade  Shows  Changing? 

The  show  seemed  somewhat  smaller  than  previous  Internet  Worlds.  Both  Netscape  and 
Microsoft  were  conspicuous  by  their  absence.  It's  not  un-heard  of  for  Netscape  to  skip  a 
trade  show,  but  usually  Microsoft  is  there  if  it  makes  any  sense  to  be  there.  Overall  the 
number  of  exhibitors  seemed  somewhat  less  than  at  past  events.  Attendance  seemed  lower 
than  one  might  have  expected,  and  I  noticed  coupons  in  the  Chicago  Tnbune  offenng  free 
attendance.  Perhaps  as  a  new  venue  the  Chicago  show  will  need  a  year  or  two  to  ramp  up. 

The  feel  of  this  show  was  different  than  past  Internet  Worlds.  Past  shows  would  appeal 
equally  to  consumers,  to  Internet  newbies,  and  to  businesses.  Here,  electronic  commerce 
seemed  to  be  the  biggest  focus,  with  major  technology  firms  such  IBM,  Sun,  HP,  and 
Oracle  emphasizing  tools  for  doing  business  over  the  Internet.  Companies  in  the  Internet 
business  feel  they  will  make  money  from  other  businesses,  not  from  end  consumers. 

Turnstile  counts  at  trade  shows  are  as  unreliable  as  police  crowd  estimates  at  riots,  but 
Internet  World  claims  to  attract  up  to  50,000  attendees  at  each  of  its  shows.  With  several 
shows  in  the  US  and  around  the  world,  more  people  may  physically  attend  an  Internet 
Wodd  show  than  the  300,000  who  subscribe  to  the  magazine  of  the  same  name.  It  seems 
neither  a  glossy  magazine  nor  all  the  content  on  the  Web  can  compete  with  the  old 
fashioned  face  to  face  convention. 
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may  make  the  task  of  finding  what  we  need  on  the  Web  a  lot  easier 
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A  friend  who  learned  his  navigating  skills  in  the  South  Pacific  during  the  Second  World 
War  once  gave  us  his  opinion  of  Columbus's  skills  as  a  navigator:  Columbus  wasn't  quite 
sure  where  he  was  going,  didn't  know  where  he  was  when  he  got  there,  and  when  he 
returned,  didn't  know  where  he  had  been.  To  our  mind,  this  pretty  well  sums  up  the 
feeling  we  often  get  while  navigating  the  World  Wide  Web  in  search  of  enlightenment  or 
at  least  informahon.  With  the  Web's  content  having  expanded  to  an  estimated  two 
terabytes  of  data,  the  sheer  magnitude  and  disorganization  of  that  information  leads  us  to 
worry  that  no  amount  of  searching  by  conventional  means  will  uncover  everything  we 
might  find  useful. 

We  know  that  we  aren't  alone  in  this  frustration.  A  recent  survey  done  at  Georgia  Tech 
found  that  nearly  a  third  of  Web  users  are  unhappy  with  their  ability  to  find  information 
on  the  Web.  Search  engines  return  Usts  of  hundreds,  even  thousands,  of  sites  to  any  query, 
the  majority  of  them  irrelevant  to  our  needs  or  of  questionable  authority.  There  is  the 
additional  problem,  as  we've  pomted  out  previously,  of  me  so-called  "invisible"  Web  — 
pages  that  do  not  exist  as  separate  files  but  are  generated  on  the  fly  when  a  user  makes  a 
specific  query.  The  Amazon.com  and  Barnes  &:  Noble  book  sites,  for  example,  are  almost 
entirely  built  from  databases  that  create  pages  at  the  request  of  individual  users.  The 
spiders  from  the  major  search  engines  never  find  these  pages,  which  is  why  typing 

(Continued  on  Page  Two) 
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Private  Profiles:    Onyx  Software 

Testing  the  Microsoft  style  on  a  fast-growing  market  opportunity 

It's  become  a  popular  cliche  when  the  subject  of  Microsoft  spinoffs  comes  up  to  refer  to  the 
growing  population  of  Redmond-area  startups  as  "the  Baby  Bills."  It  would  seem  to  be  a 
perfect  mamage  for  the  hard-sell  '90s:  Microsoft's  bottom-line  mentality  and  business 
focus  joined  with  youthful  entrepreneurship.  Certamly  all  those  programmers  and 
managers  aspiring  to  greatness  with  financing  fueled  largely  by  Bill  Gates's  stock-option 
largesse  would  like  to  believe  some  of  his  drive  and  acumen  has  rubbed  off.  Indeed, 
we've  seen  some  promising  young  companies  sprouting  around  Puget  Sound.  Of  those, 

we'd  nominate  Onyx  Software  as  the  most 
eager  to  achieve  something  like  Microsoft's 
spectacular  success  by  following  its  lead  at 
every  turn.  The  question  is  whether  Onyx 
will  carry  its  devoted  support  for  the 
mothership  further  than  is  healthy  for  a 
well-rounded  offspring. 

[Continued  an  r'age  Sixj 
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"Raymond  Chandler"  into  a  search 
engine  will  not  list  either  of  these  sites  in 
the  top  ten. 

Another  problem  is  that,  by-and-Iarge, 
searches  are  text-based.  This  ignores  all  of 
the  multimedia  data  —  video,  still  images, 
animation,  and  sound  —  that  exist  on  the 
Web.  If  you  only  want  to  see  the  trailer 
for  "Conspiracy  Theory,"  you  shouldn't 
have  to  download  a  pile  of  text  reviews 
until  you  find  one  that  has  the  video 
attached.  The  problem  of  finding  digital 
media  will  be  especially  acute  on  corporate 
intranets  as  they  become  more  widely 
used  as  repositories  for  presentations  and 
training  material  using  graphics,  sound, 
and  video. 

They  just  want  to  help 

We  are  heartened,  however,  by  the  num- 
ber of  efforts  that  have  sprung  up  over  the 
last  year  or  so  to  wrestle  with  some  of 
these  issues.  Not  surprisingly,  they  come 
mostly  from  new  companies  that  sense  an 
opportunity  to  create  a  substantial  busi- 
ness out  of  providing  smarter  ways  to 
search  in  the  same  way  that  Netscape 
Communications  and  Yahoo!  prospered  by 
offering  basic  navigational  tools.  We 
aren't  sure  that  they  are  likely  to  find  an 
application  category  as  fundamental  as  the 
browser  or  the  Web  search  engine  proved 
to  be.  But  we're  fairly  sure  that  they  are 
working  on  problems  of  considerable 
importance  to  a  lot  of  people  —  usually  a 
good  starting  point  for  an  entrepreneur. 
We'd  include  on  that  Ust  —  but  not  in 
this  letter  —  the  companies  developing 
push  technologies  of  one  kind  or  another. 
While  the  efforts  of  BackWeb  Technologies, 
Diffusion,  PointCast,  and  others  are  helping 
to  get  us  information  we  already  know  we 
want,  they  don't  do  much  for  the  problem 
of  getting  to  sites  we  aren't  familiar  with  to 
collect  data  that  may  or  may  not  be  of 
interest.  Similarly,  we  aren't  dealing  here 
with  collaborative  filtering  and  other  so- 
called  agent-based  technologies,  the 


bailiwick  of  companies  like  Autonomy, 
Firefly  Network,  and  Net  Perceptions, 

which  build  profiles  over  time  and  then 
automahcaUy  deliver  information  they 
think  is  relevant  to  our  needs.  (For  our 
discussions  of  those  subjects,  see  "Push 
Comes  to  Shove,"  November  4,  1996,  and 
"In  From  the  Cold,"  September  23,  1996.) 
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jfewster  Kahle's 
latest  brainchild 
assists  seafchefs  by 
keeping  track  of 
previous  visitors. 


Neither  push  nor  agent  technology 
deals  with  a  hindaxnental  fact  of  human 
nature:  Goals  and  interests  often  change 
from  day  to  day.  Push  technology  is  fine 
for  those  constants  people  may  always 
want  —  entertairiment,  news,  weather,  and 
sports  at  home,  or  financial  updates  in  the 
office.   But  it  is  less  helpful  if  you  need  to 
know  in  the  next  half-hour  what  the  top 
selling  drugs  were  last  year  because  you 
lack  some  crucial  facts  for  a  report.  Agents 
have  the  same  drawbacks.  Ideally,  what  a 
lot  of  people  want  is  to  type  in  a  query  — 
without  a  vast  knowledge  of  Boolean 
search  techniques  —  and  quickly  end  up  at 
the  best  source  of  information,  minus  the 
false  starts  and  dead  ends. 

Is  free  cheap  enough? 

That's  the  goal  being  pursued  by  several 
companies  that  have  surfaced  recently.  In 
some  cases,  they  hope  to  achieve  it  by 
putting  special  search  capabilities  on  the 
client  side  rather  than  on  the  server,  as  is 
typical  with  conventional  search  engines. 
All  have  a  revenue  model  reminiscent  of 
that  pursued  so  effectively  by  Netscape 
and  other  Web  pioneers:  Give  away  Lhe 
software  to  users  ar.d  make  money  by 
selling  advertising  on  the  server  or  by 
charging  Web  sites  for  Lhe  right  to  use 
the  technology. 

But  the  companies  are  using  their 
improved  search  techniques  for  very 
different  purposes.  Alexa  Internet  tracks 
the  movements  of  earlier  visitors  to  a  site, 
providing  a  kind  of  guide  to  trails  blazed 
in  the  wilderness;  Citizen  1  Software  and 
Netbot  offer  access  to  otherwise-invisible 
data  on  commerce  sites;  NetMetrics  pro- 
vides short  previews  of  Web  pages  to 
narrow  the  search  process;  Netword  is 
marketing  a  search  engine  with  a  free 
client  component  that  searches  on  key- 
words registered  by  orgaruzations  and 
individuals  and  stored  in  a  central  server. 
An  exception  to  the  client/ server  approach 
is  Magnifi,  which  addresses  the  problem  of 
searching  image,  sound,  and  video  files  by 
giving  Web  servers  the  power  to  summa- 
rize and  index  them. 

Breaking  the  trail 

Of  these  newcomers,  we  think  Alexa  may 
hold  the  most  promise  for  the  broad  search 
market.  The  brainchild  of  information 


scientist  Brewster  Kahle,  tadier  uf  Lhe 
Wide  Area  Information  Server  (WAIS),  a 
forerunner  of  today's  search  engines, 
Alexa  starts  with  the  assumption  that  the 
best  way  to  find  the  most  relevant  sources 
of  information  is  to  learn  from  the  experi- 
ence o{  others.  Alexa's  client  software 
provides  users  with  a  toolbar  that  accom- 
panies them  from  site  to  site.  Based  on 
information  that  has  been  gathered  about 
the  decisions  made  by  previous  visitors  to 
a  site,  the  toolbar  offers  suggestions  about 
where  to  go  next. 

Say  you  pick  a  site  to  leam  about 
kayaking.  Alexa  tells  you  how  well  other 
kayakers  before  you  who  were  using 
Alexa  liked  the  site  (did  they  keep  on 
clicking  or  exit  immediately?)  and  what 
path  they  followed  through  the  site.  It  also 
teUs  you  who  publishes  the  site  so  you  can 
decide  on  the  authenticity  of  the  informa- 
tion. Sites  are  rated  according  to  how 
often  they  are  updated,  the  speed  of  the 
server,  and  popularity  with  users.  To  find 
other  useful  sites  relating  to  kayaking  does 
not  involve  returning  to  the  search  engine 
or  foUowing  embedded  hot  links.  Instead, 
Alexa  continues  to  offer  up  a  Ust  of  sites 
that  other  users  have  frequented,  and 
users  are  encouraged  to  vote  on  sites  and 
to  recommend  links.  Alexa's  pitch  to 
advertisers  is  that  advertising  within  the 
toolbar,  which  is  always  displayed  on  the 
computer  screen,  provides  longer  expo- 
sure than  ads  on  individual  sites. 

Coming  attractions 

In  another  approach  to  helping  the  search 
process,  NetMetrics  provides  a 
downloadable  piece  of  software  that 
works  within  the  browser  to  provide  short 
previews  of  each  Web  page  as  it  is  down- 
loaded. The  product  automatically  pulls 
out  headlines  or  key  text  to  provide  a 
hierarchical  outline  of  all  the  pages  in  the 
site  and  a  preview  of  each  section.  In 
theory,  this  enables  the  user  to  decide 
quickly  whether  to  move  on  to  a  more 
useful  site  or  to  delve  deeper.  WebTurbo 
can  query  six  of  the  most  popular  search 
engines  at  once  and  remove  ar.y  duplica- 
tion. Search  results  can  be  organized 
along  with  the  previews  and  saved  m  topic 
folders.  We  found  the  previews  presented 
by  WebTurbo  were  somehmes -T' •'   '  ; 
and  of  uneven  quality  but  v-/et>:,  a\  general. 
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In  Our  Opinion 

After  the  search,  the  rescue 


If  there's  any  lesson  all  the  new  startups  can 
draw  from  the  bloody  jockeying  for  pole  posi- 
tion engaged  in  by  Excite,  Yahoo!,  and  the  rest  of 
the  search  crew  in  the  past  year  and  a  half,  it's 
that  the  business  of  finding  things  on  the  Web  is 
not  just  about  search.  Nor  is  it  about  the  technol- 
ogy used  to  index,  sort,  classify,  and  find  things 
online.  Instead,  all  of  that  technology  was 
merely  a  beachhead  from  which  to  launch  a 
variety  of  content  programming  sites,  editorial 
products,  and  one-off  Web  services  that  add 
value  to  what  quickly  became  a  commodity. 

Inevitably,  the  intriguing  client/server 
technology,  agent  intelligence,  and  database 
savvy  of  Alexa  Internet,  NetBot,  and  the  others 
wlU  ultimately  provide  no  more  than  a  foot  in 
the  door,  allowing  these  companies  to  develop 
more  sophisticated  content  and  service  busi- 
nesses. The  merger  of  client-side  software  and 
back-end  databases  will  enable  them  to  refine 


those  services  gracefully  over  time.  That's 
important,  because  in  the  not-too-distant  future, 
the  basic  sorting  and  searching  capabilities  of 
these  "vertically  oriented"  services  wiU  be 
devalued  as  the  Internet  evolves  into  a  more 
sophisticated  distnbuted  network  of  directory 
servers,  mobile  agents,  and  robust  metadata 
repositories. 

.As  the  software  infrastructure  of  the  Net  gets 
smarter,  online  power  plays  such  as  Netword's 
gambit  to  own  the  very  words  we  use,  and 
thereby  intrude  into  our  daily  browsing,  will 
fade  away.  Such  schemes  cannot  possibly  scale 
to  encompass  an  Internet  growing  by  leaps  and 
bounds.  Attempts  to  control  the  way  we  surf 
will  yield  to  the  need  to  assess  —  and  reassess 
—  the  fundamentals  of  seUing  software,  packag- 
ing content,  and  lunng  advertisers.  That's  a 
tough  race  to  run,  but  at  least  these  companies 
are  at  the  starting  line.      Z 
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Netwofd  hopes 
youll  go  thfough 
its  Web  site  to  get 
to  everything  on 
the  Internet. 


helpful  Ln  weeding  out  the  obvious  clin- 
kers. We're  told  that  the  NetMetrics  — 
which  launched  WebTurbo  in  July  — 
expects  to  have  25,000  users  by  next  month, 
at  which  time  it  will  begm  actively  solicit- 
ing advertisers. 

Taking  a  cue  from  America  Online's 
"Keyword"  searches,  Netword  enables  site 
owners  to  create  their  own  "Networds"  for 
specific  content  locations  such  as  an  annual 
report  or  company  backgrounder.  Users  can 
then  just  type  in  these  words  rather  than  a 
lengthy  URL  to  reach  their  destination  by 
way  of  the  Netword  server.  Netword 
recently  signed  up  Progressive  Networks,  that 
pays  a  monthly  fee  which  depends  on  how 
many  Networds  it  registers.  Individuals  who 
want  their  own  Networds  for  their  home 
pages  pay  SI  a  month. 

All  wrapped  up 

Netbot,  using  research  into  intelligent 
agents  at  the  University  of  Washington,  has 
come  up  with  a  client/server  search  assis- 
tant that  we  wouldn't  really  classify  as  an 


agent.  Netbot's  software  called  Jango,  a 
shopping  application  that  is  the  first  in  a 
series  of  application-specific  helpers, 
allows  the  user  to  search  multiple  sites 
simultaneously  for  the  product  informa- 
tion stored  in  databases  that  other  search 
engmes  can't  penetrate.  Netbot  does  this 
by  endowing  the  client  with  natural- 
language  technology  that  can  take  a  query 
and  figure  out  the  exact  terminology  for 
what  IS  bemg  searched.  Then  the  Netbot 
server  routes  the  query  to  the  appropriate 
sites,  talking  to  them  through  an  informa- 
tion adapter,  also  known  as  a  wrapper, 
that  must  be  written  for  each  site  to  be 
searched.  Future  wrappers  might  be 
developed  for  job  search,  news  retrieved,  or 
white  pages  applications. 

A^  last,  some  service 

jango's  task  is  to  help  online  shoppers 
locate  and  purchase  goods  from  online 
merchants  by  searching  the  appropriate 
-'.-  "  and  then  returning  with  product 
reviews,  specifications,  pricing,  and  other 
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Jongo's  appeal 
may  be  limited  by 
its  scant  selection 
of  stores. 


shopping  details.  Users  can  buy  only  from 
a  few  categories  of  goods  at  present, 
including  books,  wine,  flowers,  and 
computer  hardware  and  software,  because 
Netbot  is  focusing  on  only  the  top  five  or 
ten  retailers  in  each  category  as  well  as 
information-based  sites  such  as 
newsgroups  for  reviews.  The  company 
doesn't  charge  either  users  or  commerce 
sites  for  the  privilege  of  using  Jango, 
hoping  to  live  on  advertising  revenues. 
For  newcomers  to  online  shopping, 
Jango  seems  useful  in  helping  to  find  the 
places  where  books  or  flowers  are  sold  and 
providing  some  companson  pricing.  But 
veteran  shoppers  probably  already  have 
their  favorite  online  stores  and  don't 
spend  a  lot  of  time  comparative  shopping. 
We  also  note  several  complaints,  in  that 
section  of  Jango's  site  devoted  to  user 
feedback,  about  the  software  missing 
obvious  sites  and  the  limited  number  of 
sites  Jango  brought  up.  Still,  Netbot's 
notion  of  creating  a  more  powerful  vertical 
search  capability  is  certainly  one  way  to 
chop  the  Web  into  manageable  chunks. 

Getting  specific 

Like  Netbot,  Citizen  1  Software  offers 
access  to  invisible  Internet  databases 
through  a  piece  of  free  downloadable 
software  that  sets  off  parallel  searches. 
But  rather  than  just  get  a  report,  such  as 
Netbot  provides,  users  can  view  one 


retrieved  document  while  others  from 
different  providers  load  in  the  back- 
ground. Along  with  its  general-purpose 
version.  Citizen  1  also  markets  industry- 
specific  versions,  initially  for  profession- 
als in  the  pharmaceutical,  device  and 
diagnostic  industries,  called  Citizen  1 
HIT.  Searching  on  diabetes,  for  example, 
will  turn  up  information  about  every- 
thing from  clinical  trial  results  to  ongoing 
research  —  ail  from  within  a  single 
application.  Citizen  1  makes  searching 
more  convenient  by  organizing  databases 
into  categones.  Users  choose  which  to 
search  based  on  bnef  descriptions,  and 
the  chosen  databases  are  search.ed 
simultaneously.  Such  refined  capability 
doesn't  come  cheap,  however.  Citizen  1 
HIT  is  priced  at  51,500  per  seat. 

Picture  that 

Even  without  a  cUent  component,  the 
new  technology  can  help  Web  servers  at 
big  sites  do  a  better  job  of  responding  to 
queries.  Netbot,  for  example,  is  prepar- 
ing a  server  version  of  Jango.  And  that  is 
the  primary  strategy  for  Magnifi,  which 
was  cofounded  by  Eric  Hoffert,  one  of  the 
original  developers  of  Apple  Computer's 
QuickTime  multimedia  technology. 
Rather  than  forcing  users  to  do  a  text 
search  and  download  a  page  to  find  out 
whether  there  is  any  sound  or  video,  the 
Magnifi  server  installed  on  a  site  provides 


Who's  Where 


Private  companies  developing  alternative  Internet  search  technologies 

Company Phone  Headquarters  URL  Products 


Alexa  Internet 

415-561-6900 

Son  Francisco,  CA 

wv/w. alexa. com 

Navigational  aids  based  on 
usage  patterns 

Citizen  1  Software 

415-882-9404 

San  Francisco,  CA 

www. citizen  1  .com 

Industry-specific  search 
software 

Magnifi 

408-863-7200 

Cupertino,  CA 

www.magnifi.com 

Server  ttiat  allows  indexing 
of  multimedia  for  searches 

Netbot 

206-522-7800 

Seattle,  WA 

www. jango.com 

Web  shopping  assistant 

NetMefrics 

,    415-248-2333 

San  Francisco,  CA 

www.webturbo.com 

Software  for  browsing 
at  the  preview  level 

Netword 

703-528-9600 

Arlington,  VA 

www.netword.com 

Keyword  system  for  internet 
searches 
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a  preview  of  an  image,  video,  or  audio  clip. 
From  the  perspective  of  a  site  owner  with  a 
lot  of  multimedia  in  the  database,  Magnifi 
provides  a  way  to  organize  this  content  so 
that  it  IS  easily  and  quickly  searchable  and 
therefore  likely  to  pull  more  traffic.  CNN 
Interactive,  Hollywood  Online,  and  PBS 
Online  have  inst^llled  Magnifi's  server  and 
report  increased  traffic  as  people  discover 
that  they  can  do  better  searches.  While  this 
should  lead  to  higher  advertismg  rates, 
Magnifi  plans  to  make  its  money  selling 
server  software  at  520,000  a  pop  rather 
than  to  seek  a  share  of  the  ad  billings. 

Call  to  arms 

The  big  search  sites  can't  afford  to  sit  stiU 
when  it  comes  to  solving  some  of  the 
problems  addressed  by  these  startups. 
The  client-side  companies,  in  particular, 
pose  to  be  a  threat  because  they  can  take 
users  directly  to  a  site  without  having  to 
check  in  with  a  search  site.  Then  there  is 


the  increased  competition  from  well- 
positioned  newcomers  taking  direct 
aim  at  their  core  business.  CNET,  for 
one,  plans  to  debut  its  Snap  Online 
service  before  year-end,  featuring  a 
new  search  engine  that  will  not  only 
let  users  search  globally  but  enable 
them  access  to  the  databases  oi  some 
75,000  sites  selected  by  a  Snap  team  and 
updated  every  twelve  hours.  Microsoft 
is  also  expected  to  introduce  a  search 
service,  currently  code-named  Yukon. 

Considering  the  survival-of-the- 
fittest  urges  driving  all  the  large  search 
companies,  we  think  it  will  be  difficult 
indeed  for  any  of  the  newcomers  to 
hold  more  than  a  temporary  techno- 
logical lead.  But  the  need  for  quick 
and  efficient  access  to  the  Web's  vast 
resources  is  so  compelling  that  at  least 
some  of  them  wiU,  one  way  or  an- 
other, become  an  integral  part  of  the 
search  process.      Z 
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Onyx  is  akeody 
oheod  of  Micfosoft's 
eofly  growth  pace. 


(Continued  from  Page  One) 

In  terms  of  revenue  growth.  Onyx  is 
already  well  ahead  of  its  role  model.  It 
took  Microsoft  five  years  to  get  to 
58  million  in  revenue  after  Paul  Allen  and 
Mr.  Gates  launched  the  company  in  1975. 
Onyx,  founded  in  1994  by  three  Microsoft 
alumni,  was  profitable  from  the  first  year 
and  had  sales  last  year  of  59.7  million,  a 
gain  of  350%  over  1995.  This  year  Presi- 
dent and  CEO  Brent  Frei  and  Executive 
Vice  President  Brian  Janssen  are  confi- 
dently predicting  a  further  doubling  of 
revenues.  With  customers  now  number- 
ing 140  —  60%  of  them  in  the  software 
industry  —  Onyx  clearly  has  staked  out  a 
prominent  position  in  the  highly  competi- 
tive market  for  customer  management  and 
information  software. 

Big  Green  attack 

That  niche  was  a  natural  for  Mr.  Frei  and 
Mr.  Janssen,  Dartmouth  football  team- 
mates in  the  late  1980s,  who  had  devel- 
oped a  customer  management  database 
for  Microsoft's  international  subsidiaries. 
When  Mr.  Janssen  was  asked  by  Spry, 


CompuServe's  Internet  software  division, 
to  improve  its  databases  for  marketing, 
sales,  and  customer  service,  he  turned  first 
to  Mr.  Frei  and  then  to  Todd  Stevens, 
another  young  Microsoft  colleague.  They 
started  Onyx  in  the  basement  of  a  subur- 
ban home  east  of  Seattle  with  a  5150,000 
personal  investment  and  set  out  to  de- 
velop the  best  possible  customer  support 
product  based  on  Windows  NT  and,  later, 
on  Microsoft  SQL  Server. 

Onyx  entered  a  field  that  had  been 
pioneered  with  Unix-based  client/ server 
products  by  companies  such  as  Aurum 
Software,  Clarify,  Scopus  Technology, 
Siebel  Systems,  and  Vantrve.  The  market 
was  already  growing  briskly;  Gartner 
Group's  senior  analyst  Wendy  Close  says 
the  overall  use  of  technology  for  selling- 
related  functions  is  rising  by  more  than 
50%  axmually,  although  the  category  is 
difficult  to  define  since  it  involves  field 
sales  and  service,  telemarketing,  external 
help  desks,  and  support  for  third-party 
r./  sellers.  Analysts  call  it  technology- 
enabled  selling,  customer  interaction 
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management,  support  services,  and 
various  other  labels.  Ms.  Close  estimates 
that  salesforce  automatiori  alor\e  generated 
S2.7  billion  in  sales  worldwide  last  year. 
Counting  customer  service  and  support, 
the  total  for  these  overlapping  product 
Unes  reached  a  whopping  S3. 4  billion. 

Finders  and  keepers 

The  engine  driving  this  growth  is  the  high 
cost  of  winning  and  retammg  new  custom- 
ers m  the  super-competitive  sales  environ- 
ment of  the  1990s.  The  Harvard  Business 
Rez'ieiu  has  estimated  that  U.S.  companies 
are  now  suffering  an  annual  customer 
turnover  of  50%.  The  big  challenge  for 
management  is  to  reverse  this  tide  of 
customer  defections  and  at  the  same  time 
build  new  customer  relationships  by 
increasing  sales  and  support  effectiveness. 
Onyx's  Mr.  Janssen  points  out  that  it  is 
ten  times  more  expensive  to  acquire  a  new 
customer  than  to  sell  to  an  existing  client. 
Thus,  he  argues,  it's  important  that  every 
employee  dealing  with  customers  be  able 
to  get  mformation  quickly  —  something 
that  might  be  hard  to  do  when  customer 
data  is  scattered  across  many  applications 
and  databases.  One  of  Onyx's  first  cus- 
tomers was  Visio,  a  producer  of  business 
graphics  software,  which  consolidated 
several  applications  into  the  Onyx  Cus- 
tomer Center  package  to  provide  a  single 
point  of  contact  for  55  Visio  employees 
authorized  to  use  and  update  customer 
information.  For  this  convenience.  Onyx's 


customers  pay  about  52,500  per  user, 
compared  to  as  much  as  $5,000  for  compet- 
ing applications,  wfiich  typically  are  sold 
on  the  basis  of  separate  pricing  for  indi- 
vidual modules.  On  the  other  hand. 
Onyx's  ail-in-one  approach  can  be  more 
expensive  if  the  customer  needs  only  a 
couple  of  functions. 

Follow  the  leader 

While  the  product  niche  seems  promismg, 
it  may  be  Onyx's  relationship  with 
Microsoft  that  is  most  vital  to  the 
company's  success.  For  one  thing,  Mr.  Frei 
and  Mr.  Janssen  have  followed  the  iron- 
clad alumni  rule:  Don't  compete  with  the 
alma  mater,  but  do  develop  a  market  that 
can  ride  on  Microsoft  technology.  Onyx 
went  into  customer  management  software 
in  part  because  Microsoft  doesn't  make  it, 
and  the  company  has  developed  exclu- 
sively for  Microsoft  BackOffice.  Onyx 
software  runs  only  on  Windows  NT  and 
Microsoft  SQL  Server,  a  self-imposed 
limitation  that  has  the  potential  advan- 
tage of  reducing  time  to  market  for 
upgrades  and  new  products  by  eliminat- 
ing the  need  to  create  multiple  versions. 

Mr.  Janssen  likes  to  brag  that  the 
company  has  avoided  an  R&D  expense 
"matrix  of  pain"  by  not  supporting 
multiple  systems,  and  we  agree  that  a 
single-platform  focus  should  lead  to  lower 
expenses.  But  we  think  there  are  serious 
risks  as  well.  Some  analysts  view  the 
exclusive  commitment  to  Windows  NT  as 
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At  a  Glance:    Onyx  Software 


Address: 

330  1  20th  Avenue  NE 

Bellevue,  WA   98005 

Telephone: 

425-451-8060 

URL 

wwv/. onyx. com 

President  &  CEO: 

Brent  Frei 

Business: 

Customer  management  software 

Financing: 

$1  1  million 

Investors: 

Foundation  Capital, 

Technology  Crossover  Ventures 

Employees: 

175 

Revenue: 

$9,7  million  in  1996 

As  the  boom  in 
customef  sen/ice 
applicQtons  ploys 
out,  the  industry  is 
likely  to  see  some 
consolidation. 


too  limiting,  Ln  part  because  the  scalability 
of  NT  is  not  yet  proven  for  large  organiza- 
tions.  If  Onyx  wants  to  graduate  from 
seUing  primarily  to  medium-sized  compa- 
nies, says  Gartner's  Ms.  Close,  it  will  be 
forced  to  support  Oracle  database  servers 
and  Unix  hardware. 

Comes  the  Revolution 

Other  analysts  also  fault  Onyx  for  produc- 
ing software  that  has  only  limited  tools  to 
customize  and  configure  the  application 
for  individual  needs  and  for  continuing  to 
operate  with  a  16-bit  architecture  when 
much  of  the  world  has  shifted  to  32-bit 
technology.  The  company  is  also  faulted 
for  being  too  focused  on  the  needs  of 
customer  service  and  support  departments 
and  for  not  being  sufficiendy  attuned  to 
field  sales  organizatior^s  that  require 
software  for  setting  sales  goals  and  fore- 
casting. Onyx  says  it  does  have  such 
software,  called  Revolution,  under  devel- 
opment and  promises  that  when  it  is 
shipped  early  next  year  the  product  will  be 
fully  ported  to  a  32-bit  environment. 

Where  Onyx  seems  most  like  its  giant 
neighbor  is  in  an  aggressive  and  innova- 
tive approach  to  sales  and  marketing.  As 
one  example,  the  company  junked  the 
usual  mix  of  direct  mail,  advertising,  and 
trade  shows  when  it  found  that  its  World 
Wide  Web  presence  was  the  most  efficient 
and  successful  marketing  channel.  Now 
the  Web  provides  over  30%  of  sales  leads 
and  has  helped  give  Onyx  a  reputation  for 
responsiveness. 


In  the  meantime,  Onyx's  competitors 
are  also  tearing  ahead.  .Although  this  is  an 
industry  without  a  standard-bearer,  there 
are  leaders  in  specific  sectors.  Siebel,  for 
example,  is  emerging  in  the  vanguard  of 
sales  opportunity  management  systems. 
Dun  &  Bradstreef  s  Sales  Technologies 
subsidiary  in  applications  for  a  healthcare 
products  sales  force,  and  Austin-based 
Trilogy  Development  Group  m  product 
configuration  systems. 

Our  venhxre  friends  believe  that  the 
current  market  boom  will  continue  for 
about  the  next  five  years,  with  consolida- 
tion slowly  taking  place  as  the  front-office 
sales  and  service  applications  are  inte- 
grated with  back-office  manufacturing  and 
financial  software.  This  process  has  already 
begun  with  the  recent  acquisition  of  Aurum 
by  Boon,  the  Netherlands-  and  Caiifomia- 
based  business  software  company.  We 
imagine  that  Oracle,  PeopleSoft,  and  SAP 
might  be  thinking  along  the  same  lines. 

Please  be  more  specific 

In  such  a  market,  newcomers  like  Onyx 
will  have  to  find  a  way  to  stand  out. 
Despite  the  many  claims  made  by  all  the 
entrants  in  this  developing  market  space, 
there  remain  doubts  that  the  vendors  are 
offering  much  more  than  a  general  solu- 
tion to  the  highly  individual  problems 
companies  face  in  maintaining  and 
strengthening  relationships  with  their 
customers.  Paul  Koontz,  a  partner  at 
Foundation  Capital,  an  Onyx  investor, 
notes  that  the  market  is  so  big  that  the 
software  companies  have  not  yet  been 
forced  to  develop  any  true  vertical  align- 
ment or  to  give  their  products  the  flexibil- 
ity needed  to  adapt  to  differing  needs. 

What  clients  really  want,  say  analysts, 
and  what  will  take  much  more  develop- 
ment, is  a  product  that  is  rich  in  functions 
and  with  tools  that  are  readily  adaptable, 
so  that  no  sales  force  is  locked  into  just  one 
way  of  doing  things.  Onyx  might  well  be 
among  the  first  to  achieve  this.  The 
company,  we  are  told,  is  working  on  ways 
to  make  its  product  less  monolithic  and 
more  flexible,  a  direction  it  must  ulti- 
mately take  if  it  IS  to  attract  larger  organi- 
zations. Onyx  knows,  like  the  godfather  it 
emulates,  that  it  can't  afford  to  have  any 
ideas  cast  in  stone.      Z 
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Why  should  your  Web  site  be  D/6fX  driven? 
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Alexa  Highlights  Web  Site  "Trustmarks" 

[August  25]  Web  navigacon  device  .^lexa  Intemei  wilJ  now  display  a  privacy  'seal  of  approval"  as 
designated  by  l"KU$Te  for  those  Web  sites  promoting  secure  online  pnvacy. 

TRUSTc  LS  a  nonpro6t  organizahon  that  assigns  Truscmarki, '  or  logos,  that  when  cbcked  on  provide 
users  With  infonnation  on  what  kind  of  information  a  Web  site  collects,  what  it  docs  wiih  Chat 
information,  and  with  whom  that  mfonnation  is  shared 

Alexa  Internet  is  a  search  tool  that  mchxles  detaUcd  information  on  Web  sites  being  visited,  including 
who  owns  the  site,  where  the  company  is  physically  located,  how  many  pages  it  contains,  and  how 
"fresh"  the  site  is.  or  how  often  a  site  updates  its  content. 

Launched  in  mid-Julv.  Alexa  Internet  is  a  toolbar  thar  resides  below  a  user's  Web  browser  window 
Users  will  be  able  to  find  out  whether  a  Web  site  displays  a  Trustmark  upon  opening  the  "Where  Arc 
You  Now"  window  in  the  toolbar 

"With  TRUSTe,  our  user?  now  receive  additional  infonnation  regarding  the  way  a  site  handles 
personal  data.  For  example,  an  individual  interested  m  purchasing  clothes  on  the  Internet  could 
quickly  view  which  sites  adhere  to  TRUSTc's  pnvacy  standards  before  they  spend  the  time  to  go  into 
tlie  site  and  judge  whether  they  should  do  businesi  with  this  site,"  said  Brewster  Kahle,  president  and 
co-founder  of  Alexa  Intemet 

The  .AJexa  toolbar  is  compatible  with  Windows  95  and  Windows  NT  operating  systems,  and  can  be 
downloaded  for  free  from  Alexa's  Web  site. 
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Alexa's  Theory  of  Relativily 

Revte^v:  Filtering,  analvticai  algorithjms  link  to  Web  sites — relevant  or  not 


ST  j1MBlMU.f^«tTJM 

■  «r.,..,~<.  n,TA  ,-.-<-.  v«  .»ni,  x  .  ^^. 
H  ^rancTsco  startup  hciD"'*'ebu.v;nl1'>ii 
#4uv:iiil  inr<;rm3i;on  ^v  npplying  cri|- 
labf^ntiie  rmenng  lechrHiiogy  lo  the  in- 
ure Wob 

Aleta.  frotp  Aleia  (nicmtt  Inc..  pro- 
vide; 3  series  of  links  lo  *eb  utej  ihai  ar« 
n:ljii.d  .0  ihc  on;  ihe  ireer  ts  viewing  ?C 
VVctk.  Lab^  found  u  vers-  ;traiehrfor^»rd 
lO  use.  leuing  us  view  bij'C  infofmanon 
iuch  as  a  Weh  'iiie's  owr.t:.  iis  populirilv 
ibi!;iionnumbero(vnruon)andihelTesh- 
.less  of  Its  information  i  ha^ecl  on  ntrv  of- 
■en  It  ;s  updaied).  VVe  ulso  could  vote  on 
*hc!iiirornot  D  iue^esiiUsiie  *asuse'jl 

in  ■^inc  c.isc->.  'he  Ater^  "ei,i  led  u.-".  lO  : 
cr-  'ds^cfjl  related  sure  Jm^t  ^idnt  icncv  ■ 


u*xa  jrsndM  mrermcdm  abMt  t»k  vba  u4  ;«ki  ta  r^att*  litM. 


•ti?i;d  inoiKercjv.1.  ho^oer.  ;ites  iiiai 
A  let  J  considered  related  ■*«  re  onlv  vague- 
ly r;l,ited  or  were  trrelevani  to  US-  The  ba- 
sic jilc  informiiion  that  Alexa  providet 
also  netd<  seme  ^vorlc 

The  prDtram,  vhich  i^  <el  to  ship  next 
.Tioiuh.  ^mII  be  free.  The  conipinv  Soptt  '  fy 
'd  n.-i.ince  thi  program  bv  selling  3dvi;r- 
■I'ernenc;  iliai  will  appear  in  tlie  tool  bar 
Alua  w(.'rlis»nh  Microvjf>  Coq-  md  Net- 
-c.Tp«Ccmniun[C3liriri5Cirr>  Wthbrow^- 
~r.  runnmi  -inder  W,nd<j.vi  15  or  Win- 


other  vendor?  lohclpcommcrnnl'itescre- 
Jiw  p^r^oii.ili/cd'nicrn.-i^MLi'viih  vUiturs. 
A!<;.-.a  s  antonz  tbe  Srsi  to  »pp'v  I'ns  leeh- 
nolorr  1(1  Ihe  enure  Web  Ciirrerily.  b<- 
cDu-w  ihe  beta  t$  tie*  and  iimuctj  in  scope. 
AlCTa  creflies  ib«  list  of  related  links  for  a 
stt;  b\  rnonitonnj  where  u«r5  wertt  after 
viewing  that  stte. 

The  beta  refease  lei  us  vole  f>T»  i  sug- 
gested S(te  ;  relevance,  but  it  *9sn't  clear 
if  AJeta  included  our  vote  m  «  dewrmi- 
nalion  of  related  sites  In  ;he  future,  the 
program  will  determine  Linics  9»*ed  partly 
on  iiS^n'  votts.  company  oiTxuU  iaid. 

Creeling  links  baifid  on  users'  Web 
crawling  .ieldetj  hu-or-mixs  rejtjtsin  leais. 
Whtn  *e  '.iewed  the  PC  w-cic  jue  relat- 
ed 'insj  included  some  wotih*hiie  sites. 
but  5bo  so«-i<  site!  aht)je 
relevance  a»s  question- 
able. tosjvnele.ijiMhe. 
National  Computer  Tec- 
tonics Online  $ite  also 
was  listed,  for  t;tample) 
Other  lesti  provided 
lini^s  "ith  more  rele- 
vance to  If?  lest  sue. 
however. 

According  lo  compa- 
ny orTicials  -hese  irrej. 
uhrities  are  caused  by 
the  rcloliveiv  Jmjll  num- 
ber of  lisers  .n  the  .-Mexa 
bei.i  program  The  offi- 
cials e;«  pea  -esulU  to  he 
more  accurate  after  the 
program  ships  jnd  more  ujers  browsing 
pjttems  nre  analyzed. 

Users  can  add  any  site  to  the  rcUled  linkj 
for  3  W?h  page.  Alexa  monitors  users  «»- 
tnbuinjns  and  id<ls  these  links  into  the 
equation  when  delermming  related  sites 


,  speed  and  freshness  proved  lo  be  highly 
:  sudiCclivc  jrtd  not  icry  useful  in  icSM.  For 
eiample.  server  speed  cj  currendy  based 
on  ho*  long  It  takes  to  relnave  pages 
from  i  me  to  Alesas  offices  in  San  Frar»- 
cijco.  Alihou^  this  miijht  be  useful  to  AJora 
eraployres.  It  mtir»s  jimow  nothing  lo  users, 
espedaify  those  in  other  parts  of  the  coun- 
I  try  or  m  other  countries. 

This  nie>sure  of  >p«ed  and  freshnessabo 
caused  Aiexa's  own  site  lo  apptif  much 
mors  powerful  th»n  such  huge  sites  as  Net- 
scape Communications  Corp.'i  or  Yahoo 

j  inc  ■«  locuies— Ai«u  Kdrcd  J  Fail  ntJn^ 

I  compared  with  these  tiles  Average  ratings 

I  in  OUT  teaii. 

I  A  i>le  5  freshrtiLts  ij  Jftirrnined  hy  the 
average  age  of  pages  on  the  *ite.  with  fresh 
sues  having  an  average  age  of  less  than  sw 
months  stiic  sites  ioav.ii-jge  of  more  than 
rwo  years,  and  aJ  the  risi  rated  as  r-erage 
in  age 

TTiis  led  to  very  misleading  scores  in 
our  tesw — Sites  that  are  updated  several 
limes  per  day  got  average  Kores  because 
tliey  have  very  good  archives,  whereas  new 
'iies  that  badn  t  been  updated  in  a  few 
months  scored  as  frrsh.  We  think  it  would 
be  more  usifui  to  umply  me  sues  as  fresh 
(less  th.m  6  months  old)  or  3iale. 


Aim  1.0  Beta 


«4  a  »W  akt  ID  a  u«ar's  Wae  croosw-.  AJna 
shows  some  cromisa  for  h«ipr<  tB»^  flno 
jlifis  that  othar  us«rs  found  IS  be  usaki  or 
intereiOni.  i-io»wv«r,  th«  b*a  coutd  use 
some  'tTcrov«me«j  to  (s  inaHka  *id  m 
how  IT  dettrmnas  stta  mfannaftofl  txtvz 
rsreaoyrcwnew^uM. 

O  HeiDSijs^rsfWstoi-^jiBdBthosa 
they  ara  brwr&n^ 

©  m/ormaijon  provided  on  iil«s  can 
fc«  n-<yea(JV^  reatBd  Jrto  dont  afways 


[f  AJera  can  improve  nte  jifo«m«tion 
It  *ot<W  be  ■Jietiil  if  duat  mforraation  -vrre 
supplied  for  reoomitiended  imkx  as  "■veil. 
Thcs  would  for  ttamplc.  iei  'jjcrs  kno™  • 
Site  s  ratine  anci  freshne-a  before  (hei-  ac- 
lualfy  accessed  that  site. 

Alttj  include;  a  chai  Lnierfaci  ihal  Jcis 
users  coramumcaie  in  real  Kme  «ilh  oth- 
er -Vlexa  users.  We  liked  Utit  a  -jscr  had  :o 
know  another  usert  addren  -.n  order  >o 
chM.  because  this  uved  ua  .'rom  being  both- 
ered oy  unwanted  dui  requests 

Ale«  s  archive  capabilny  feature  was 
not  fully  functional  m  the  bca  we  tested 
but  this  feature  will  eveniuallv  allow  users 
to  access  an  online  archive,  .-namiavntd  bv 
Ajeu,  of  pages  ihat  ar:  rither  cemporap- 
ly  not  available  or  have  been  disabled  ' 


bnwllnt  thr«agh  fldtfs 

5  lein  J.ii.i  li  jleaned  from  a  combinoticn 
of  collnbt^rativr  filtenn;  ind  jnjivltcal  .il- 
jondmn  ipplied  'o  frequent  Web  crawls. 
Cjil.nvntrve  fiitenng  lecHnoiogyis  ivp- 
■o.Tilv.  u.'^^d  vn  speci.Tli^red  rieclmoic  com- 
inc-.-j  '.I.e.-  such  33  Aiiiar.oncom  and  ^s 
i:i;c-j  •in  ihe  premise  ihai  'dsers  with  sim- 

lir  :iiicrc5t5  will  like  vhc  ijme  things. 
-■  r  c-..iinrlt;,  if  a  user  hked  a  ceri.im  '  cent  .if  i 
Sii.-.r^ln-n  Xing  novel.  Ihi;  W-.-ij  ;,(e  iniehi  ^  sur  itjl 
-.:0"ii-"'-nd  A  Clivi  3arn3r  book  -fiA'.  ,  lower  n 
J..-1-  ik-d  ^y  oihcr  users 'J.no  :nj(ivcd  ih:  ■  A  «ep 
:<;.i;  -^oc^-  input  w, 

C'■>ll.liv^,-atl^el'illennglec^nol'<-'^vl.^u^id   ;    die  prou 
-1  -...l1i..'s  fr.iiii  rircflvN'cv.vork  inc  .-lod  The  ■'' 


,Mjr>v  of  AJeta  i  unking CBpab^ities come 
frr5m  mcniionng  users  browsing  paths. 
*hlch  'v.ll  probably  ?et  oil  ;il.irms  .imong 
(hov;  ;oncar-n£d  about  pnvac->-  However, 
comoiin>  oiflciais  daim  ihjt  all  loforma- 
iion  iS  logged  jnonymously  and  ibey  have 
no  wnv  ic  determine  *hat  particalat  users 
ari  doing. 

When  it  cjme  lo  providing  basic  infor- 
maiirn  about  Ihe  site  wi  were  viewing, 
Alc-v.i  :.ime  jp  a  litUc  short  For  example. 
3t  ]1r!t  glance,  a  five  jtar  raiirvf  recmed  to 
implv  ihe  iiigh  quality  of  itv:  -xc  Howev- 
er, ive  diicoverid  that  the  itars  were  sim- 
plvmensurtjol  number  of  visitors,  and  did 
not  address  ^uoiily  at  all.  The  .op  10  per- 
cent ni  the  most  vi.tiied  siitrt  earned  five- 
SUr  itjiuj-.  sue?  "itn  fewer  -tsuors  h.id  a 


irate  quality  rating  based  on  user 
1  ^e  available  m  fuiurs  releases  of 
.'Ct.  Aietj  officials  s-nd. 
iCTi  bc!i  S  inform.iiion  .-in  server 
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13  Dovi/nload  of  the  week 
Alexa 

This  is  one  of  the  most  interesting  and  useful  Internet  tools 
weVe  ever  seen.  The  prerriise  IS  simpJe:  as  you  surf  the  Web,  a 
toolbar  appears  on  the  bottom  of  your  Web  browser.  This 
toolbar  contains  several  buttons,  each  leading  fo  Information 
about  the  site  you're  vterting.  You  can  check  a  site's  quality 
rating  (1  to  5),  the  speed  of  its  connection,  how  many  pages  it 
contains,  and  the  relative  "freshness "  of  its  material.  You  can 
also  find  out  who  owns  Mne  domain  nam©  for  each  site  you  visit, 
and  exactly  where  the  company  is  located-usefui  if  you  like  to 
dash  off  letters  to  your  favorite  sites'  owners,  but  can1  ever 
seem  to  find  their  email  addresses. 


But  the  highlight  of  Alexa  is  its  recommended  links  button.  For 
every  site  you  visit  Alexa  will  suggest  up  (o  ten  other  sites  with 
similar  subject  matter.  The  recommendations  are  sometimes 
way  off  base,  but  they're  far  more  intelligent  than  the  usual 
robot-generated  responses.  For  example,  a  visit  to  CNN  gives 
you  direct  links  to  competing  news  organizations;  a  trip  to  The 
Dilbert  Zone  can  send  you  down  a  pathway  to  other  extremely 
popular  newble  sites  like  the  White  Housq:  and  a  visit  to  a 
catalog  of  pom  sites  will-rather  surprisingly-give  you  lots  of 
wicked  suggestions. 

You  can  grab  a  beta  copy  of  Alexa  today.  You  need  a  486  or 
better  PC  mnning  Windows  95  or  Windows  NT.  and  Netscape 
Navigator  2.0  or  Internet  Explorer  3.0  (or  higher).  Just  click  the 
link  below  to  get  started; 

Download  Alexa 
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recording  technology  was  primitive.  "When  it  comes  to 
a  point  where  users  have  a  camcorder  recording  the  Net, 
then  the  archive  won't  be  worth  it,"  he  says. 

In  Building  116,  the  archive  is  stored  in  a  digital-tape 
library  that  looks  like  a  vending  machine.  The  tapes 
currently  have  the  capacity  to  hold  20  terabytes  of  data 
in  all,  about  as  much  information  as  is  in  the  Library  of 
Congress.  So  much  content  is  being  added  to  the 
Internet  that  the  archive  grows  by  about  a  terabyte  of 
data  each  month.  The  data-mining  computers  are  able  to 
adjust  their  site  visits  to  concentrate  on  those  that 
change  most  frequently.  They  will  come  upon  a  site, 
however,  only  if  Alexa  users  have  visited  it,  if  anyone 
else  on  the  Web  has  linked  to  it,  or  if  it  is  listed  with  a 
directory  service. 

While  the  archive  has  been  able  to  keep  up  with  textual 
information  on  the  Internet  ~  it  is  complete  from 
October  1 996  to  the  present  -  the  effort  to  collect 
images  is  running  a  few  months  behind. 

Through  Alexa,  the  archive  receives  about  14  requests 
for  old  pages  every  second.  Not  bad,  its  founder  says, 
when  one  considers  that  only  about  9,000  people  visit 
the  San  Francisco  Public  Library  on  an  average  day. 
Alexa  also  helps  researchers  by  listing  facts  about  the 
site  they're  visiting:  the  address  of  the  individual, 
company,  or  other  organization  that  owns  the  server  on 
which  the  site  is  located;  how  many  people  have  visited 
the  site;  how  frequently  the  site  is  updated;  how  fast  its 
computers  are;  and  how  many  pages  the  site  contains. 
Alexa  also  allows  users  to  vote  for  their  favorite  sites 
and  keeps  a  running  total  on  each  site. 

Mr.  Kahle  says  Alexa  does  not  keep  individual  statistics 
on  its  users.  Although  the  search  engine  tracks  the  paths 
of  users  as  they  jump  from  site  to  site,  it  does  not  record 
users'  names.  "We  don't  care  who  you  are,"  he  says. 
"We  just  care  what  path  you  take." 

Mr.  Kahle  dreams  that  Alexa  could  become  as  popular 
-  and  as  profitable  —  as  search  engines  like  Yahoo!  and 
Excite.  And  the  Internet  Archive,  he  says  with 
enthusiasm,  could  become  part  of  a  large  research 
library,  although  he's  not  sure  how.  "I  don't  think  about 
the  details,"  he  says.  "That's  why  we're  doing  something 
now  that  others  thought  was  impossible,  or  even  crazy." 
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least  some  of  the  top  sites,"  Mr.  Kahle  adds. 

Still,  one  needs  a  traditional  search  engine  or  a  specific 
Web  address  to  get  started,  Mr.  Kahle  acknowledges. 
And  Alexa,  unlike  the  Alta  Vista  search  engine  and 
others,  can  suggest  linking  only  to  entire  Web  sites,  not 
to  specific  pages  within  them. 

"I  don't  think  of  it  in  the  same  way  as  a  search  engine  -- 
it's  a  supplement,"  says  Bruce  Livett,  a  reader  and 
deputy  head  of  the  biochemistry  and  molecular-biology 
department  at  the  University  of  Melbourne,  in  Australia. 
"Alexa  gives  you  relevant  sites  in  the  general  sense, 
sites  that  you  sometimes  miss  because  other  search 
engines  depend  on  specific  keywords  you  enter." 

Dr.  Livett,  who  has  been  using  Alexa  since  October, 
surfs  the  Web  to  keep  up  with  the  research  work  of 
colleagues  around  the  world.  "It's  competitive  work,  and 
I  need  to  know  what  they're  doing."  Alexa,  he  says,  has 
alerted  him  to  research  sites  that  did  not  turn  up  in 
searches  using  Excite  and  Anzwers,  a  search  engine 
designed  for  Web  users  in  Australia  and  New  Zealand. 

Part  of  Alexa's  appeal,  he  says,  is  access  to  the  Internet 
Archive.  When  Alexa  users  get  a  dreaded  "404  -  file 
not  found"  error,  they  can  click  on  a  button  on  Alexa's 
tool  bar  and  pull  up  the  missing  page  from  the  archive. 
Using  the  archive,  Mr.  Livett  found  an  audio  interview 
he  needed  that  had  been  removed  from  a  Web  site. 

Alexa  is  "an  immediate  use  for  the  archive,"  says  Mr. 
Kahle,  adding  that  the  archive  is  the  component  that  he 
expects  will  eventually  separate  his  search  engine  from 
the  rest  of  the  pack.  Alexa  also  offers  a  direct  link  to  the 
Encyclopaedia  Britannica  Web  site,  allowing  users  to 
retrieve  reference  information  without  leaving  the  Web 
page  they  are  viewing  at  the  time. 

As  more  people  begin  to  use  Alexa  and  the  archive, 
however,  tricky  questions  about  copyright  and  privacy 
have  begun  to  crop  up.  An  e-mail  discussion  list  for 
Web  publishers  recently  included  a  heated  debate  about 
copyright  issues  surrounding  old  newspaper  articles  that 
are  part  of  the  archive. 

The  data-mining  computers  skip  Web  pages  that  require 
passwords,  as  well  as  Web  sites  protected  by  the 
Standard  for  Robot  Exclusion,  which  blocks  search 
engines  from  copying  pages  or  directories.  Still,  some 
Web  publishers  said  in  the  e-mail  discussion  that  Alexa 
officials  should  be  asking  on-line  newspapers  and 
journals  if  they  want  to  be  part  of  the  archive,  instead  of 
forcing  them  to  block  Alexa  from  copying  pages. 

How  the  archive  will  be  used  in  the  long  term  is  not 
clear.  Mr.  Kahle  often  mentions  the  early  days  of 
television,  when  programs  were  broadcast  live  and 
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For  example,  from  the  "Perseus  Project" 
(http://www.perseus.tufts.edu).  a  site  with  an  extensive 
collection  of  ancient  Greek  texts  in  translation,  Alexa 
points  the  user  to  sites  about  classicists  and 
Mediterranean  archaeologists  at  the  University  of 
Michigan,  sites  about  publishers  and  journals  available 
electronically,  sites  about  Hellenistic  linguistics,  and  to 
"Project  Gutenberg,"  an  Internet  producer  of  free 
electronic  texts. 

Since  October,  more  than  200,000  people  have 
downloaded  Alexa.  The  service,  which  Mr.  Kahle  hopes 
will  soon  be  fully  supported  by  advertising  revenue,  is 
not  yet  turning  a  profit.  But  by  the  end  of  the  year,  he 
expects  it  to  have  a  million  users.  Eventually,  he  hopes 
to  use  the  profits  from  Alexa  to  finance  the  gathering  of 
data  for  the  archive. 

The  advantage  of  Alexa  as  a  search  engine  is  that  it 
"attempts  to  be  an  objective  source"  for  people  seeking 
information.  Where  conventional  links  are  chosen  by  a 
page's  creator  according  to  what  the  creator  knows  and 
prefers,  Alexa  also  brings  other  Web  users'  knowledge 
and  preferences  to  bear.  The  sites  recommended  in  a 
given  search  sometimes  change,  depending  on  the 
surfing  patterns  of  Alexa  users. 

The  system  has  its  oddities.  If  users  frequently  traveled 
from  the  "Perseus  Project"  to,  say.  The  New  York  Times, 
the  newspaper  could  be  added  to  the  top- 10  list  of  an 
Alexa  user  looking  at  the  Perseus  site,  even  though  the 
only  thing  the  two  sites  have  in  common  is  their  users. 
In  fact,  such  a  situation  has  already  occurred.  From  the 
Perseus  site,  Alexa  suggests  —  based  on  other  users' 
habits  -  visits  to  the  sites  of  Franklin  and  Marshall 
College  and  Bates  College.  Alexa  officials  say  students 
at  the  two  colleges  probably  use  the  Perseus  site  in  their 
classes. 

"It's  sometimes  random  and  not  always  perfect,"  Mr. 
Kahle  says.  "But  if  researchers  use  a  traditional  search 
engine,  they  may  miss  some  of  the  best  sites."  A  search 
engine  such  as  Excite,  using  the  keywords  "Greek  texts" 
to  find  sites  related  to  the  "Perseus  Project,"  turned  up 
268,057  matches.  "With  Alexa,  you're  bound  to  hit  at 
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Just  as  the  Internet  has 
allowed  all  kinds  of 
ordinary  people  to  become 
their  own  publishers,  it  has 
allowed  a  computer 
scientist  named  Brewster 
Kahle  {left)  to  create  the 
non-profit  Internet  Archive. 

Mr.  Kahle  decided  to  save 
l^M    y  the  Internet's  contents  for 

^^.  9-  '/j     posterity  after  selling  his 

Tied  Mertz  foi  The  c  hronicie    prcvious  vcnturc,  the  Wide 
Brewster  Kahle  started  the        ^rea  Information  Server,  to 
non-profit  Internet  Archive       Amenca  Online  for 
after  selling  his  previous  $15-million.  The  system, 

venture,  the  Wide  Area  which  he  invented,  makes  it 

easier  to  search  electronic 
data  bases. 


Information  Server,  to 
America  Online  for 
$15-million. 


Why  an  archive?  "We  need  to  preserve  this  heritage," 
says  Mr.  Kahle,  an  affable  and  enthusiastic  37-year-old 
who  is  a  graduate  of  the  Massachusetts  Institute  of 
Technology.  "Or  one  day,  digital  anthropologists  will 
wonder  if  we  ever  learned  anything  from  the  history  of 
other  inventions.  Remember,  nobody  recorded 
television  in  the  early  days." 

Once  he  started  collecting  all  that  information,  Mr. 
Kahle  says,  he  realized  how  difficult  finding  things  on 
line  was  becoming,  with  the  number  of  Web  sites 
doubling  every  six  months  even  as  other  material  falls 
into  neglect.  So  he  set  about  creating  a  Web  search 
engine  using  the  technology  he  developed  to  manage 
the  massive  amounts  of  data  he  was  collecting  for  his 
quirky  history  project. 

The  result  is  Alexa,  a  search  engine  operated  by  Alexa 
Internet,  the  for-profit  company  that  is  part  of  the 
Internet  Archive.  "This  will  change  the  way  that 
researchers  use  the  Internet,"  Mr.  Kahle  says. 

Alexa  is  software  that  can  be  retrieved  free  from  the 
company's  Web  site  (http://www.alexa.com)  and  added 
to  a  Web  browser.  Unlike  other  search  engines,  such  as 
Yahoo!  and  Excite,  it  doesn't  rely  on  word  searches. 
Instead,  it  watches  where  its  users  go  on  the  Internet, 
and  then  records  that  infonnation  in  a  central  data  base. 
Based  on  that  information,  Alexa  can  tell  a  user  the 
most  popular  paths  that  other  Alexa  users  have  taken 
from  the  site  the  user  is  visiting  at  a  given  time. 

It  also  can  suggest  other  sites  offering  related  material. 
The  top  10  sites  pop  up  in  a  thin,  gray  bar  near  the 
browser  {see  below)  and  change  as  the  user  moves  from 
page  to  page. 
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In  Attempting  to  Archive  the  Entire 
Internet,  a  Scientist  Develops  a  New  Way  to 
Search  It 

Non-profit  group  uses  'data  mining'  in  effort  to 
preserve  World-Wide  Web  content 


By  JEFFREY  SELINGO 


SAN  FRANCISCO 


For  1 10  years,  Building  1 16  served  unobtrusively  as  a 
general  store  and  as  quarters  for  non-commissioned 
officers  stationed  at  the  Presidio,  the  U.S.  Army  base 
south  of  the  Golden  Gate.  Now  that  the  base  has  been 
decommissioned.  Building  1 16  stands  out. 

Its  red-tile  roof  and  cream-colored  shiplap  siding  have 
been  restored,  for  one  thing.  And  it's  one  of  the  few 
buildings  still  in  use  on  the  1,480-acre  property,  which 
became  a  national  recreation  area  in  1995.  What  makes 
Building  1 16  unique,  though,  is  what's  inside  -  a  virtual 
preservation  project  that  aims  to  create  a  complete 
archive  of  the  Internet. 

In  a  back  room,  a  buzz  comes  from  giant  computers  that 
are  "data  mining"  the  World-Wide  Web  and  Usenet 
discussion  groups,  finding  and  recording  pages  of  the 
computer  network  on  a  nearby  digital-tape  machine. 
The  computers  take  complete  snapshots  of  the  Web 
every  two  months,  allowing  users  to  find  pages  long 
after  their  owners  have  taken  them  down  and  let  their 
hyperlinks  lapse.  So  far,  the  archive  has  compiled  eight 
terabytes  of  data  —  the  equivalent  of  800,000  books  — 
and  has  recorded  at  least  three  snapshots  of  more  than 
500,000  Web  sites. 

Inside  those  virtual  books  is  the  patchwork  history  of 
ordinary  people:  pages  of  college  students  long 
graduated;  Web  sites  of  polifical  campaigns  since 
forgotten;  early,  awkward  versions  of  sites  that  are  now 
well-known;  infamous  sites  that  held  our  attention  for 
weeks,  such  as  the  Heaven's  Gate  cult's  page. 


ALSO  SEE: 

The  World-Wide  Weh  site 
of  the  Internet  Archive. 
and  a  new  Internet  search 
engine  that  can  be  used  to 
find  material  in  the 
Internet  Archive 
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Alexa  shows  the  way 

by    Christopher    Lindqijist 
(  8  /'  1  3  /•  9  7  ) 


uaaLSSEiBii 


Search  engines,  agents,  push 
fust  in      clients— everyone  wants  to  tell 
"'  you  where  to  go  on  the  Web. 

Unfortunately,  most  solutions 
spit  out  a  dry  list  of  links  based 
on  keywords.  Alexa  Internet's 
Alexa  service  takes  a  more  democratic 
approach.  And  it's  more  than  just  a  link  lister; 
it's  a  full-blown  information  service  that  gives 
you  the  skinny  on  nearly  every  site  you  visit. 

The  Alexa  toolbar  floats  on  your  screen.  As 
you  surf,  the  program  retrieves  a  variety  of 
site  information  from  Alexa  Internet's  servers. 
Go  to  any  site,  and  Alexa  reports  the  name  of 
the  company  that  owns  the  domain  name,  the 
server's  relative  speed,  and  how  many  pages 
exist  on  the  site. 

You  also  get  a  list  of  alternative  site  links. 
Rather  than  simply  querying  a  search  engine, 
Alexa  invokes  a  variety  of  methods  to  guide 
users.  Alexa's  servers  constantly  crawl  the 
Web  and  search  page  text  to  find  similar 
content.  The  servers  also  analyze  patterns  of 
where  people  travel  on  the  Net.  Add  a  site  to 
your  link  list,  and  Alexa  notes  the  new  site 
and  gives  it  a  high  rating.  If  enough  people 
add  the  site,  it  may  eventually  get  onto  the 
toolbar's  default  site  list.  The  folks  at  Alexa 
Internet  can  also  manually  insert  sites  into  a 
list  to  bring  interesting  new  locations  to  the 
front.  Finally,  Alexa  sells  advertising  space  on 
its  lists  in  the  form  of  tiny  color  banners  that 
also  act  as  links. 

Alexa's  unique  search  approach  often 
produces  interesting  results.  For  instance,  a 
quick  AltaVista  search  on  woodworking  called 
up  the  usual  unfathomable  62,000  listings. 
However,  after  we  made  a  quick  visit  to  a 
favorite  woodworking  hobbyist  site,  Alexa 
produced  links  to  four  similar—and 
useful-pages  we  had  never  seen  before. 
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That  said,  things  don't  always  work  as 
expected.  Pages  focused  on  words  with 
muhiple  meanings  can  produce  odd  results 
(for  instance,  a  church  mission  link  attached  to 
a  site  touting  mission-style  furniture).  Some 
smaller,  less- frequented  sites  don't  produce 
any  links  at  all.  And  Alexa's  algorithms  for 
determining  a  site's  average  freshness  of 
content  need  some  tweaking  (they're  biased 
against  large  sites). 

Once  you  learn  to  navigate  the  sometimes 
cryptic  toolbar  interface,  you'll  be  able  to 
contact  Alexa's  tech  support,  enter  a  real-time 
chat  with  other  Alexa  users  (though  chat 
features  were  rather  limited  in  our  beta 
software),  check  a  site's  popularity,  rate  sites 
with  a  thumbs  up  or  down,  and  dig  into 
financial  info  about  a  site's  parent  company.  If 
the  page  you  want  no  longer  exists  on  its 
original  server,  you  may  be  able  to  retrieve  the 
page  from  Alexa's  extensive  archives. 

If  you've  been  looking  for  a  tool  that  helps 
provide  some  context  for  the  sometimes 
chaotic  Web,  give  Alexa  a  try. 


(A  limited  beta  is  available  now.  The  final 
version  is  due  at  the  end  of  this  summer.) 
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Alexa 

Direct  price:  free 

Alexa  Internet,  415/561-6900 

P°  486,  3MB  disk  space,  Windows  95  or  Windows  NT,  Navigator 
2.0  or  Internet  Explorer  3.0 
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INSIDER  PROFILE: 
Brewster  Kahle 


»reaffTntervie\ 
»hear  intervie\ 


JQO 

Internet  Archivist, 
Entrepreneur 

Home 

Lives  in  one  of  30 

officers'  homes,  now 

part  of  Presidio 

National  Park,  in  San 

IFrancisco. 

Education 

IB.S.  in  Connputer 

IScience,  Massachusetts  Institute  of  Technology, 

1982 


Most 


accor 


Archived  the  Web  on  2  terabytes  of  storage, 
devised  systenn  to  navigate  Web  following  the 
trails  of  where  others  have  gone. 

Tra^;;^  R;:;;CO>"u 

Built  Connection  Machines  at  Thinking  Machines 
until  1992.  Founded  WAIS  Inc.  in  1992.  Sold 
IWAIS  to  Annerica  OnLine  in  1995  for  $15  million. 
In  1996  left  WAIS  to  form  Alexa  Internet  and  the 
Internet  Archive. 


'I  think  the  technology  we  are  building  is  really 
Itrying  to  interconnect  people  better.  If  people 
communicate  better  and  faster,  you  can  make  the 
whole  organism  smarter.  That's  what  we're 
shooting  for." 


"It  takes  longer  than  you  can  possibly  imagine." 


Married,  two  sons. 
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Net  Surf:  Alexa's  New  Navigation 
Service 

by  Joey  Anuff 

6:06pm  1  Aug.97.PDT  "Net  Surfing,"  as  a  concept, 
is  an  anachronism  -  a  throwback  to  the  days 
when  the  Web  was  all  about  motion,  and  rarely 
about  rest.  In  the  days  when  every  page, 
personal  or  corporate,  was  a  "homepage,"  the 
common  denominator  was  the  hotlist,  and  every 
Web  site  felt  compelled  to  carve  its  identity  via  a 
list  of  links.  It  made  sense  -  before 
server-pushes,  Java,  and  stylesheets,  the  most 
sophisticated  Web  technology  was  the  href. 

It  still  is.  Too  much  so,  in  fact.  By  now,  almost 
everybody  realizes  that  traffic  is  the  true  global 
currency.  Outside  of  bona  fide  homepages  and 
Yahoo,  most  off-site  links  are  ads,  paid  for  by 
people  who  need  more  traffic  in  order  to  sell 
more  traffic.  The  surreal  mathematics  of 
trafficking  in  traffic  may  become  better 
understood  and  standardized,  but  they  won't  go 
away.  Which  creates  a  huge  opportunity,  a 
chance  to  reintroduce  the  connectedness  the 
Web  was  originally  created  to  enable.  Thus  Alexa 
-  a  new  navigation  service  conceived  by 
Brewster  Kahle,  inventor  of  WAIS  and 
co-founder  of  Thinking  Machines. 

Ironically,  while  reminiscent  of  quite  a  few 
collaborative  agent  filtering  sites,  search 
engines,  and  bookmark-management  schemes, 
Alexa  is  neither  a  Web  site  nor  a  plug-in,  but  an 
application  that  works  in  cooperation  with  your 
browser.  It  functions  as  a  toolbar  that  presents 
an  active  analysis  of  every  site  you  visit  -  who 
owns  it,  how  many  sites  link  to  it,  how  many 
pages  are  on  it,  and  how  popular  it  is  amongst 
other  Alexa  users.  More  importantly,  it  provides 
multiple  suggestions  for  where  to  go  next  - 
based  on  which  sites  are  similar,  which  sites 
others  have  fled  to,  and  which  sites  others  have 
linked  it  to.  And,  of  course,  who  has  paid  Alexa 
to  be  included. 

But  don't  fault  them  for  the  slight  intrusion  of 
commercialism.  That's  a  given.  What's  more 
alarming  is  the  fact  that  the  ploy  works,  more  or 
less.  Some  might  be  impressed  by  its 
instant-messaging  add-on  or  its  archive  service, 
which  strives  to  eliminate  404s  forever,  but  these 
are  merely  equivalent  to  leather  interiors  and  a 
rear  spoiler,  entirely  tangential  to  the  quality  of 
the  ride.  Even  in  its  infancy,  with  its  current  surf 
suggestions  based  not  on  user  feedback  but  on 
more  mundane  technical  comparisons,  Alexa's 
pointers  often  make  no  sense  at  all,  but  are 
precise  and  impressive.  It  all  adds  up  to  a 
two-part  puzzle:  Will  it  take  an  application  distinct 
from  the  browser  itself  to  bring  back  the  notion  of 
motion?  And  if  so,  does  anybody  still  want  to 
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Alexa  gets  you  where  you  want 
to  go 

Navigation  service  alleviates  search  engine 
roulette 


It's  no  secrec  that  search  engines  are  noiv  one  of  the  weaker 
links  on  the  Web-  When  Web  users  go  to  <.i  search  site,  they 
usually  expect  plenty  of  duplicate  URLs,  irrelevant  sites  and 
links  to  pages  that  vanished  weeks  or  even  rnonths  ago. 
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Alexa  Internet,  a  S.in  Francisco- 
based  stari-up,  is  offering  a 
product  that  may  provide  a  real 
alternative  to  search  engine 
roulette.  The  company's  Alexa 
navigation  service  is,  in  equal 
pans,  a  search  engine,  a  direc- 
rory  sen/ice  a  la  Yahoo,  an 
archive,  and  a  collaborative  fii- 
tering  system.  The  resulting 
combination  is  a  novel — and  ef- 
fective— tool  for  navigaring  the 
Web. 


The.Ale:i;a  sep.'ice  consists  of  a 
client  application  that  appears 
as  a  toolbar  at  the  bottom  of  a 
user's  display  When  the  user 
goes  to  a  Web  site — a  travel 
agency,  for  evample— the  client 
simultaneously  sends  the  URX.  to 
the.-Uexa  server,  which  suggests 
other  Web  sites  that  offer  re- 
lated content.  Even  if  the  initial 
link  is  Same  and  obscure  [try  en- 
tering 'trc-vel  agency"  in  Alta 
Vista  and  see  vvbat  you  geti). 


Alexa  can  point  to  higher  qu^'' 
ity  sites— Travelocity,  perhaps, 
or  TravelWeb.  If  the  user  clicks 
through  to  a  related  but  some- 
what different  sice,  such  as  a 
travel  firm  that  deals  specifi- 
cailv  with  cruises.  Alexa  vvill 
note  (he  distinction  and  display 
updated  links  that  match  the 
user's  interests. 

Ale.\a  vvorki.  first,  by  regularly 
crawling  and  archiving  the  en- 
tire public  Web  [about  two 
terabytes  of  data)  at  the  rate  of 
around  a  terabyte  per  month. 
The  service,  which  has  been  cer- 
tified by  the  nonprofit  TRUSTe 
(formerly  knoxvn  as  e-Trusc)  ini- 
native,  honors  the  robots.txt 
protocol  and  will  ignore  sites  at 
(he  owners  request.  Ale.\a  also 
regularly  turns  over  its  archives 
to  the  Internee  Archive,  a  non- 
prorit  organization  dedicated  to 
preserving  a  series  of  historical 
'snapshots"  of  the  Web. 

While  .-Uexa  archives  the  Web. 
the  system  uses  several  tech- 
niques to  analyze  sites.  First. 
Alexa  uses  neural-net  technol- 
ogy CO  pe.-form  a  te.tt  analysis  of 
sites.  looking  for  pages  and  sites 
with  similar  characteristics.  The 
system  also  analyzes  link  srruc- 
tures.  detecting  patterns  that 
point  to  paniculatiy  common  or 
popular  sites.  The  idea  here  is  to 
exploit  the  expenise  built  into 
the  thousands  of  link  lists  and 
"favorite  sites"  pages,  in  almost 
every  conceivable  category,  al- 
ready available  on  che  Web.  In 
addition,  the  system  makes  cre- 
ative use  of  public  databases  to 
find  companies  operating  in  the 
same  business  category— and 
that  might  have  similar  Web 
sices. 

Alexa  also  includes  data  from 
the  anonymized  surfing  habits 
of  its  users:  which  sites  they  visit, 
how  deep  they  go,  how  long  they 
linger  and  where  they  go  next. 
Users  can  also  explicitly  state  . 
cheir    opinions    by   voting   on  , 
whether  or  not  they  found  a  ■ 
given  site  helpful.  ! 

The  system  offers  some  other 
interestmg    features.    Because 
Alexa  archives  Web  content,  us-   ■ 
ers    who    click    through    to    a 
'dead"    page    can    receive    an  ; 


archived  copy  of  the  page  in- 
stead of  the  usual  "404 — ^N'ot 
Found"  error  message.  The 
toolbar  can  also  provide  infor- 
mation on  who  owns  a  site,  ho-.v 
many  pages  are  on  the  site,  how 
many  other  Web  sites  point  to 
the  site,  how  frequently  it  is  up- 
dated and  how  popular  it  is 
among  other  .Aile.xa  users.  Fi- 
nally, Alexa  will  show  which 
other  users  are  online  and  allow 
real-time  communication  vnth 
them. 

Ads  that  stJck.  .As  always,  the  key 
question  is  "where  s  the 
money?"  .-Me^a  is  using  an  ad- 
based  business  model,  placing 
the  ads  on  the  toolbar  and  on  an 
extended  pulldown  list  of  rec- 
ommended sues.  The  ads  are 
context-sensitive,  so  that  users 
looking  for  char  perfect  travel 
site  might  see  a  Travelociry  ad — 
even  whiie  they  browse  through 
competing  sites. 

While  we  wouldn't  hazard  a 
guess  on  whether  this  mriant  of 
Web  advertising  wiU  fly  when  so 
many  others  have  crashed  and 
burned,  it  is  certainly  interest- 
ing. By  keeping  rhe  ad  on  the 
.-Me.xa  toolbar,  the  service  can 
keep  an  advertiser  visible  across 
a  series  of  Web  sites,  something 
that  even  che  biggest  site-based 
ad  networks  cant  promise  yet. 
.AJexa  places  ads  only  on  desig- 
nated areas  of  the  toolbar,  keep- 
ing plenry  of  free  space  for  the 
system's  objective  choices. 

Pn-v-acY  issues  might  also  be  a 
concern  here,  but  in  this  case 
Alexa  has  done  everything  Pght 
The  company  guarantees  that 
data  from  users  will  not  be  di- 
vulged TO  other  firms.  Alexa  has 
also  secured  an  endorsement 
from  TRUSTe.  a  nonprofit  con- 
sortium organized  by  the  Elec- 
tronic Frontier  Foundation  and 
CommerceNet,  which  monitors 
and  certifies  cites  that  adhere  to 
its  privacy  guidelines.  Users 
seem  to  appridate  the  effort;  ac- 
cording to  Z  Smith,  the  com- 
pany's V?  of  engineering,  more 
than  90"?%  of  the  beta  users  have 
sent  their  E-mail  addresses  to 
use  Ale:ia'3  instant  mesisging 
feature   Initially,  the  company 
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had   predicred   rhac   only   30"' 
would  do  30. 

We  had  an  opportunity  to  se- 
the  AJexa  system  demonstrate  ' 
and  then  to  cr>'  it  ourselves.  Th 
resuirs  were  impressive;  Alex 
was  able  :o   provide   rclevar 
links  from  iome  prettv  obscur 
sites,  and  icdid  a  good  job  ofta. 
loring  its  recommendations  s 
we  surfed  through   sites  wit 
somewhat    different    types    f  .- 
contenc.  We  did  encounter  son  ^ 
bugs,  such  as  ihe  occasional  oi  - 
topic     recommendation      li-:. 
AJexa  is  aJso  rather  slow  at  del; .  - 
ering  pages  out  of  its  archive,  a 
process  that  can  cake  as  long  is 
15  mmutes.  Pan  of  this  is  Inevi- 
table, since  the  system  keeps 
90%  o!  its  data  on  tape  robccs. 
Part  of  ic  is  also  due  to  a  roL;.^h 
caching  modei  for  what  goes  ;n 
the  system's  disk  cache,  wh   re 
the  other  lO'^o  of  the  data  is  avail- 
able for  higher  speed  access. 
Given  the  system  s  impressive 
showing  at  such  an  early  stage  in 
its  development,  however,  we 
believe  it  will  get  even  better  as 
larget  numbers  of  users  come 
online  and  .Alexa  continues  to   : 
refine  its  technology. 

Alexa  isn't  going  to  replace 
traditional  search  engines, 
which  will  remain  the  best  op- 
tion for  users  who  have  a  very 
specific  query  and  a  good  idea  of 
what  they  e.xpect  to  get  in  return. 
.Nor  will   it   replace   directory- 

The  5«yaoiO  fteoort  on  Internet  PuDllshlng 


based  services  such  as  Yahoo, 
which  can  offer  more  complete 
hsts  of  options  organized  in  a 
distinct  taxonomy.  .Aiexa  does, 
however,  provide  a  convincing 
alternative  for  more  general 
searches  and  conte.xt-based 
"where  ne.>;c?"  queries,  just  the 
kinds  of  searches  that  make 
search  engines  so  frustrating 
and  pamful  to  use.  Alexa  is,  in 
short,  a  great  idea:  we  hope  the 
company  can  combine  that  idea 
with  effective  marketing  and  a 
sustainable  business  model. 

Alexa  is  currently  taking  appli- 
cations for  its  beta  program, 
which  will  run  through  the  sum- 
mer. According  to  Smith,  the 
company  hopes  to  have  the  sys- 
tem ready  for  general  use  in  a 
couple  of  months. 

Marr  McKenzie 
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Compilation  of  Alexa  Press  Coverage 


As  of  September  3,  1997,  Alexa  has  received  coverage  in  the  media: 


Headline: 

Publication: 

Date: 

Author: 

Alexa' s  Theory  of 
Relativity 

PCWeek 

8/18 

Jim  Rapoza 

Cache  This 

Upside 

8/18 

Tish  Williams 

Download  of  the  Week 

C/Net  TV  -  The 
Web 

8/18 

C/Net 

Alexa  Internet 

Network  News: 
Information  for 
Business  and 
Computer 
Applications 

8/17 

Alan  Gathn 

FYI:  Popularity  Context 

Star  Tribune 

(Twin 

Cities/Minnesota) 

8/4 

New  York  Times 

Scent  of  a  Web  site 

Inman  News 
Features 

8/4 

Inman  News 

The  Web  HotList 

InfoWorld 

8/4 

Jim  Battey 

Internet  boom  just 
beginning,  conference  told 

Reporter 
(Vacaville,  CA) 

8/3 

San  Francisco  Chronicle 

Internet  mdustry  'not  even 
close  to  peaking' 

The  Midland 
Reporter- 
Telegram 
(Midland,  Tx) 

8/2 

Jon  Swartz 

Net  Surf:   Alexa' s  New 
Navigation  Service 

Wired  News 

8/1 

Joey  Anuff 

Smart  Searching  at 
Internet  Wodd  (Web 
Addict  Section) 

Web  Review 

8/1 

Wes  Thomas 

Site  Reviews:  Alexa 
Internet 

Media  Central 
Digest 

7/29 

Sean  Wolfe 

Alexa:   Merging  Web 
Searching  and  Group 
Experiences 

Internet  Outlook 
(http://www. 
webreference, 
com) 

7/28 

Richard  Wiggins 

Alexa  Keeps  Surfers  on 
Beaten  Path 

Inter@ctive  Week 
(print) 

7/28 

Tom  Steinert-Threlkeld 

Alexa  Searches  While  You 

PC  Week 

7/28 

Jim  Kerstetter 
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Headline: 

Publication: 

Date: 

Author: 

Surf 

Blacks  Say  Internet  Otters 
Equality 

The  Plain  Dealer 
(Cleveland,  OH) 

7/28 

From  staff  and  wire 
reports 

Officials  See  No  End  in 
Sight  for  the  Popularity  of 
the  Internet 

Post-Bulletin 
(Rochester,  MN) 

7/28 

Jon  Swartz  (SF 
Chronicle) 

New  Service  Tracks  Web 

Use 

Plainview  Herald 

7/28 

New  York  Times  Service 

Revolution  Brewing  at 
Presidio:    Alexa  Internet 
hopes  to  change  the  way 
you  use  the  Net 

San  Francisco 
Examiner 

7/27 

Zachary  Coile 

Net  Growth  Virtually 
Exploding 

Austin  American 
-  Statesman 
(Austin,  TX) 

7/26 

Jon  Swartz  (SF 
Chronicle) 

The  Internet  Boom  Is  Just 
Beginning:    Big  business  is 
the  next  frontier,  attendees 
at  Chicago  trade  show  say 

San  Francisco 
Chronicle 

7/25 

Jon  Swartz 

Getting  Even  More 
Personal... 

Interactive 
Services  Report 

7/25 

No  byline 

E-commerce  about  to 
launch? 

The  Middletown 
Press 
(Middletown,  CT) 

7/25 

Jon  Swartz  (SF 
Chronicle) 

Outside  The  Box 

San  Jose  Mercury 

News 

7/24 

Chris  Nolan 

As  Go  Surfers,  So  Goes 
Alexa 

Wired  News 

7/24 

Chns  Oakes 

Internet  Industry  Giants 
Snub  Expo 

The  Desert  Sun 
(Palm  Springs, 
CA) 

7/24 

Jon  Swartz  (SF 
Chronicle) 

2  Internet  Giants  Won't 
Show  at  Expo:   Microsoft, 
Netscape  are  skipping 
Chicago 

San  Francisco 
Chronicle 

7/23 

Jon  Swartz 

New  Ad-Backed  Service  to 
Monitor  Context  of  Web 
Pages 

Internet 

Advertising 

Report 

7/23 

Compiled  by  Beth  Cox 

Service  Turns  Page  in 
Aiding  Web  Users 

The  Oakland 
Press  (Pontiac, 
MI) 

7/23 

New  York  Times  News 
Service 

New  Guide  Offers  Net 
Directions 

Inter@ctive  Week 
(online) 

7/22 

Tom  Steinert-Threlkeld 
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1                Headline:                      Publication:           Date:                  Autiior: 

New  Search  Tool  Tracks 
User's  Path  Through 
Internet 

The  San  Diego 
Union-Tribune 

7/22 

New  York  Times  News 
Service 

News. Scan:    Today's 
Headlines  from  the 
Mainstream  Press 

PC  Magazine 
Online 

7/22 

NewsWatch 

Looking  for  a  Second 
Commercial  Hit,  Brewster 
Kahle  Launches  a 
Navigation  Service 

Web  Week 

7/21 

Margaret  McKegney 

Alexa  Internet  Lands  at  the 
Presidio 

San  Francisco 
Business  Times 

7/21 

Steve  Ginsberg 

New  Service  Tracks  Web 

Use 

Nando.net 
(Reprint  of  NY 
Times  Article) 

7/21 

John  Markoff 

New  Service  Tracks  Web 
Use 

New  York  Times 

7/21 

John  Markoff 

Alexa  Internet:    The 
Search  as  a  Communal 
Effort 

New  York  Times 

(CyberTimes 

Extra) 

7/19 

Laurie  J.  Flynn 

Recent  &  Decent  Web 
Tools 

Family  PC  News 
Online 

7/18 

Michelle  Megna 

Alexa  Browser  Companion 
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Alexa  Searches  So  You  Don't  Have  To 


by  Lisa  Moskowitz 

September  24.  1997 

What  if  you  didn't  have  to  rely  on  keywords  or  search  engines 
to  pinpoint  information  on  the  Web?  Say  hello  to  Alexa  1.0,  a 
free  Web  navigation  service  that  works  with  your  browser  to 
guide  you  through  the  ever-growing  maze  of  pages  that  is  the  Web 


Alexa  is  a  gray  toolbar  that  floats  on  your  screen  or  neatly  anchors  beneath  the 
browser  window.  When  you  land  on  a  site,  a  pop-up  box  on  the  toolbar  gives 
you  a  list  of  up  to  ten  related  sites  you  might  be  interested  in  visiting.  For 
example,  if  you  to  go  to  CNN.com,  Alexa  will  suggest  the  MTV,  New  York 
Times,  ESPN  Sportszone,  or  Yahoo  sites  as  possible  next  Web  destinations. 
These  "Where  To  Go  Next?"  listings  are  based  on  Alexa's  searches  for  related 
content  and  by  the  surf  patterns  of  other  Internet  users.  Sometimes  the 
suggestions  are  right  on  target.  Other  times  they  appear  to  be  totally  random. 

In  addition  to  the  "Where  To  Go  Next  :*"  feature,  Alexa  also  has  a  "Where  Am 
I?"  pop-up  box  that  gives  you  data  on  a  particular  site.  For  instance,  you  can  find 
out  who  owns  the  site,  how  many  pages  it  has,  its  popularity  among  all  Web 
users  (based  on  the  number  of  hits  the  page  has  received),  how  often  its  pages 
are  updated,  and  how  quickly  they  load.  Also  on  the  toolbar,  you'll  find  online 
help  and  a  link  to  the  Alexa  site.  You  can  vote  on  the  site  you're  currently 
visiting,  send  messages  to  other  Alexa  users,  and  retrieve  pages  from  the  Alexa 
Archive  if  you're  looking  for  an  older  version  of  a  page.  A  new  feature  in  the 
final  version  is  access  to  Encyclopaedia  Britannica  online.  Abstracted  entries 
are  available,  as  are  a  Webster's  dictionary  and  thesaurus. 

Although  Alexa  is  a  useful  tool  that  helps  narrow  your  Web  search,  it  does  have 
its  drawbacks.  Toolbar  icons  aren't  labeled  and  you  have  to  put  your  mouse  over 
them  to  reveal  their  functions.  Less  popular  sites  often  don't  have  links,  while 
the  suggested  links  at  some  sites  don't  divulge  the  slightest  clue  as  to  what's  on  a 
page.  Other  links  seem  to  work  off  of  a  site's  server  rather  than  its  content.  For 
example,  I  looked  at  a  site  about  Chinese  pug  dogs  on  a  server  at  the  University    \ 
of  California  at  Davis.  Instead  of  guiding  me  to  other  sites  about  pugs  or  dogs  in-'^' 
general,  Alexa  zeroed  in  on  the  host  server  and  suggested  I  visit  environmental 
information  sites  next. 

So  it's  not  perfect,  but  Alexa  is  a  pretty  good  tool  for  surfing  the  Web  without 
having  to  jump  back  and  forth  between  a  site  and  a  search  engine.  And  even  if 
the  suggested  links  don't  cough  up  the  exact  facts  you  went  online  to  find,  you'll 
end  up  exploring  Web  sites  you  might  never  have  known  existed.  What  better 
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way  lo  uiscover  me  irue  oreaum  oi  iniormaiion  avaiiaoie  on  me  weo.' 

Alexa  works  with  Internet  Explorer  3.0  or  higher,  Netscape  Navigator  2.0  or 
higher,  and  Netscape  Communicator  4.0.  You  can  download  it  now  by  clickin* 
on  the  link  to  your  right. 
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Alexa  accessorizes 
browsers 

By  Alex  Lash 

September  22,  1997,  5:00  p.m.  PT 

Alexa  Internet  released  today  the  first 
commercial  version  of  its  reference  and 
navigation  tool  that  plugs  into  a  browser  and 
provides  background  information  on  Web  sites. 

Once  downloaded  and  installed,  the  Alexa 
software,  which  is  free  and  appears  as  a  toolbar 
at  the  bottom  of  a  user's  browser,  monitors  a 
user's  surfing  to  display  statistics  for  Web  sites, 
suggest  related  sites,  and  offer  quick  links  to 
online  reference  material. 

With  the  glut  of  information  on  the  Web,  not  to 
mention  a  glut  of  search  engines  and  directories. 
Alexa's  in-browser  tool  could  come  in  handy  for 
those  who  don't  want  to  click  through  to  another 
site  for  ancillary  information. 

Alexa  sees  its  product  as  similar  to  that  of 
Firefly  Network  and  others  that  seek  out  users' 
potential  preferences,  as  well  as  search  engines 
and  directories  such  as  Yahoo  and  Excite. 

"The  advantage  of  what  we  do  compared  to 
Yahoo,  for  example,  is  that  we  stay  with  you  as 
a  dashboard  wherever  you  go,"  said  Alexa 
spokeswoman  Cynthia  Lohr.  "Search  engines 
currently  don't  do  that." 

A  button  on  the  toolbar  links  the  user  to 
thumbnail  guides  of  the  Encyclopaedia 
Brittanica,  Webster's  Dictionary,  and  Webster's 
Thesaurus. 

Alexa  uses  intelligent  agents,  also  known  as 
"bots"  or  "crawlers,"  to  gather  and  display 
information  about  each  Web  site  the  user  visits. 
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The  Alexa  toolbar  instantly  displays  the 
following  information  about  each  site:  who  has 
registered  the  site;  the  number  of  pages  on  the 
site;  how  many  other  sites  point  to  this  site;  how 
frequently  the  site  is  updated;  and  the  site's 
popularity. 

Alexa  also  suggests  relevant  links  to  each  site  by 
combining  its  users'  feedback  with  the 
navigation  paths  of  previous  visitors  to  the  site. 
The  company  hopes  to  garner  revenue  by  selling 
space  in  the  links'  windows  to  advertisers, 
similar  to  the  method  search  engines  such  as 
Yahoo  use  to  match  banner  ads  to  users' 
keyword  searches.  The  company  will  not  sell  the 
recommended  links  themselves,  a  spokeswoman 
said. 
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Because  Alexa  is  the  commercial  arm  of  the 
Internet  Archive,  it  taps  that  resource  to  display 
Web  pages  that  are  no  longer  live.  Instead  of 
fetching  a  "not  found"  error  message,  for 
example,  the  end  user  will  go  to  the  archived 
version  of  an  old  page.  The  nonprofit  Internet 
Archive  project  seeks  to  store  for  posterity  Web 
pages,  Usenet  discussions,  and 
shareware-anything  Net-related  that  will  be  lost 
to  history.  Alexa  gives  back  by  adding  to  the 
archive  any  new  sites  its  users  come  across. 

Alexa  users  who  know  each  others'  email 
addresses  can  check  to  see  who  else  is  online. 
Without  switching  to  a  separate  email  client, 
those  online  can  send  each  other  instant  text 
messages,  which  pop  up  from  the  toolbar. 

The  software  currently  runs  on  Windows  95  and 
NT.  The  company  says  it  will  have  Macintosh, 
Windows  3.1,  and  Unix  versions  in  the 
unspecified  future. 
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Tracking  the  Web's  beaten  paths 

His  archive  helps  Brewster  Kahle  point  the  way  through  cyberspace 


Bv  Eli/abelh  Weise 
USA  TODAY 

SAN  FRANCISCO  -  When 
he  really  gets  going  about  his 
ideas,  such  as  archiving  the  en- 
tire Internet  or  mapping  the 
Web  by  looking  at  the  trails  us- 
ers leave,  Brewster  Kahle  can 
sound  like  Robin  Williams  —  if 
Williams  had  gone  to  MIT. 

Kahle  hunches  down  beside 
a  Coke  machine-size  box  full  of 
computer  tapes  that  together 
hold  the  equivalent  of  about 
SO.non  books  His  hands  leap 
into  the  air,  his  voice  shifts  oc- 
taves, and  suddenly  he's  two 
supercomputers  trying  to  find 
a  speed  at  which  I  hey  can  talk. 

"  Tlnw's  (his  for  you?'  'Fast- 
er, lots  faster.'  'Fine,  how's 
this?'  'No  noise  on  this  end.' 
'Great,  I'll  start  shipping  data 
now.'  'Fantastic.  I'm  ready!' " 

Described  by  Microsoft's 
chief  technical  officer  Nathan 
Myhrvold  as  "a  crazed  lunatic, 
brilliant  visionary  and  nice  guy 
all  rolled  into  one,"  Kahle  has 
always  thought  big 

In  the  early  198(ts,  he  was  a 
scientist  at  a  company  called 
Thinking  Machines,  where  he 
designed  "massively  parallel" 
computers,  powerful  machines 
made  up  of  100,000  .small  com- 
puters connected  by  a  fast  net- 
work. They  broke  huge  prob- 
lems into  small  bits  that  could 
be  solved  simultaneously. 

Now  Kahle,  .36,  is  turning  his 
expertise  in  the  analysis  of 
really  big  amounts  of  data  to 
the  problem  of  finding  things 
on  line  His  new  company, 
Alexa  Internet,  is  a  World  Wide 
Web  navigation  service  that 
gives  users  information  about 
where  they  are  and  also  rec- 
ommends where  to  go  next. 

At  his  offices  in  a  renovated 
Victorian  general  store  in  the 
former  Presidio  Army  base 
here,  Kahle  talks  a  visitor 
through  the  service,  his  blue 
eyes  glowing  with  pride  under 
a  cloud  of  curly  blond  hair. 

Like  a  trained  tracker,  Alexa 
(http;//www.alexa.com)  helps 
guide  users  roaming  the  Web. 
Named  for  the  lost  library  in 
ancient  Alexandria,  it  creates 
a  thin  toolbar  that  shows  a  con- 
stant stream  of  information: 
who  registered  the  site  you're 
at,  how  often  it's  updated,  how 


His  Internet  preservation  act 


Brewster  Kahle  founded 
the  nonprofit  Internet  Ar- 
chive in  1996  to  record  the 
cacophony  of  human  voices 
being  con.stantly  created  — 
and  deleted  —  on  line. 

"The  Net  isn't  10  chan- 
nels of  TV.  It's  something 
fundamentally  different 
and  worth  preserving." 
Whether  a  publisher  hated 
your  manuscript  or  democ- 
racy failed  to  take  root  in 
your  country,  "the  Net  is  the 
answer, "  he  says.  The  ques- 
tion was  how  to  make  sure 
historians  would  have  ac- 
cess to  it. 

Using  "Web  crawling" 
programs,  which  copy  ev- 
erything they  find  and  send 
it  back  to  the  archive's  serv- 


er in  .San  Francisco,  Kahle 
makes  a  full  copy  of  the 
Web  and  Usenet  discussion 
groups  every  six  months. 

The  archive  is  stored  on 
.•iOO  tapes,  each  of  which 
holds  SO  gigabytes,  the 
equivalent  of  about  S0,000 
hooks  Kahle  estimates  that 
a  copy  of  the  Web  at  its  cur- 
rent size  will  take  about  4 
trillion  bytes  of  data  —  4 
terabytes,  or  the  equivalent 
of  a  good-size  urban  library. 
The  Library  of  Congress 
holds  about  20  terabytes  of 
data. 

Copies  of  all  the  tapes  are 
also  being  stored  in  Seattle, 
where  Kahle  hopes  eventu- 
ally to  build  a  think  tank 
centered  on  the  archives. 


many  pages  it  contains. 

But  it's  as  a  guide  that  Alexa 
shines.  Using  anonymous  data 
from  Net  traffic  nodes,  the  ser- 
vice sees  what  paths  others 
have  taken  and  offers  them  to 


users  as  a  small  pop-up  list, 
ranked  by  which  links  were 
most  heavily  traveled. 

"It's  a  sort  of  chatty  naviga- 
tor that  in  some  metaphorical 
sense  has  talked  with  a  lot  of 


Py  JuIW  S1upsl<er.  AP,  for  USA  TOOAr 

Web  visionary:  Brewster  Kahle, 
36,  v^ltti  the  collection  of  500 
computer  tapes  that  hold  a  re- 
cent copy  of  the  entire  Web,  the 
equivalent  of  50,000  books. 


people  and  can  give  advice 
about  routes  to  take,"  says  Jer- 
ry Michalski  of  Release  10,  an 
industry  newsletter. 

And  users  never  have  to  see 
the  dreaded  "Error  404  — 
page  not  found"  message.  If  a 
Web  page  no  longer  exists, 
Alexa  will  find  a  recent  copy  in 
Kahle's  Internet  Archive  and 
serve  it  up.  Alexa  must  be 
downloaded  to  be  used;  it's  ad- 
supported  and  free  to  users. 

Both  Alexa  and  the  archive 
spring  from  Kahle's  fascina- 
tion with  libraries  as  founts  of 
information.  His  interest  dates 
to  the  late  1980s,  when  he  de- 
veloped the  Wide  Area  Infor- 
mation Server  (WAIS),  a  pio- 
neering Net  publishing  system. 

Although  HTML  won  out  as 
the  primary  publishing  format, 
the  method  Kahle  devised  for 
indexing  the  Net  became  one 
of  the  most  popular  lookup 
tools  of  Ihe  time,  so  popular 


that  in  199,"),  America  Online 
paid  $15  million  for  it.  That 
gave  Kahle  the  funds  to  pursue 
his  dreams,  one  of  which  was 
preserving  Ihe  digital  past. 

No  one  who  knows  Kahle  is 
surprised  that  when  he  decid- 
ed there  should  be  an  archive 
of  the  Internet,  he  just  sat  down 
and  made  one.  He  tends  to  pur- 
sue his  passions  wholehearted- 
ly, whether  they're  technical 
problems  to  tDe  solved  or  social 
networks  to  be  knit. 

Take  the  Thursday-night  pot- 
luck  dinners  Kahle  and  his 
wife,  Mary,  have  held  for  the 
past  10  years.  F^ch  includes  a 
question.  "What's  the  most  in 
teresling  game  you've  ever 
played?"  or  "What's  the  strang- 
est place  you've  ever  slept'" 
Every  guest  is  obliged  to  an- 
swer in  the  form  of  a  story. 

Kahle,  who  graduated  from 
MIT  with  a  degree  In  artificial 
intelligence  and  F^slern  reli 
gions,  delights  in  inviting  new- 
comers to  these  meals,  the  cou- 
ple's answer  to  the  problem  of 
meeting  interesting  people 
once  they'd  left  college 

It's  his  engineers  "let's 
make  something  to  fix  this 
problem"  attitude  that  has  got- 
ten Kahle  so  far.  But  some- 
times his  enthusiasm  for  ideas 
causes  him  to  overlook  pracli- 
cal  considerations,  observes 
one  of  the  fathers  of  the  Inter- 
net, Vinton  Cerf,  now  al  MCI 

Kahle  has  sidestepped  the 
looming  issues  of  copyright 
and  privacy  raised  by  copying 
Web  pages  without  their  cre- 
ators' express  permission.  You 
can  always  take  a  page  down, 
but  if  it's  in  the  archive,  anyonf 
can  still  access  it.  Special  cod- 
ing can  be  included  to  prevent 
a  page  from  being  archived  or 
indexed,  but  many  people  don't 
know  this. 

Kahle  says  that  by  not  wor- 
rying about  the  details  he's 
able  to  do  things  others  think 
are  impossible. 

And  that  alone  is  enough  to 
make  Cerf,  someone  who 
knows  a  thing  or  two  about  big 
plans,  respect  Kahle:  "He's 
contributed  more  than  his  fair 
share  of  interesting  and  inno- 
vative ideas." 

Says  Cerf,  "I  think  Brewster 
is  the  kind  of  visionary  who 
bears  watching." 


You  are  a  database  and  access  abounds 
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You  are  a  database  and  access  abounds 

Today's  typical  Net  surfer  can  get  all  kinds  of  information  that 
would  have  taken  a  private  eye  to  turn  up  in  the  past,  says  Tom 
Sterner  of  San  Francisco. 

He  ought  to  know.  Sterner  is  an  investigator  with  Decision 
Strategies  International,  and  he  isn't  at  all  surprised  at  heightened 
concern  about  privacy  in  the  age  of  the  Internet.  "Generally  people 
freak  out  about  people  knowing  stuff  about  them,"  he  says. 

Sterner  and  most  investigators  use  specialized  databases,  whose 
"prohibitive"  expense  keeps  them  from  the  reach  of  the  layman. 

But  that  doesn't  mean  you  can't  find  a  lot  of  the  same  material 

elsewhere. 

"If  information  exists  in  one  place,  it  exists  in  more  than  one 
place,"  says  database  expert  Carole  A.  Lane,  who  will  testify 
Tuesday  at  Federal  Trade  Commission  hearings  about  privacy 
online. 

She  calls  that  rule  "Lane's  First  Law  of  Information."  And  it  has  a 
corollary:  "If  information  exists  anywhere,  no  matter  how 
carefully  guarded,  it  exists  somewhere  else,  where  virtually 
anyone  can  gain  access  to  it." 

Lane  makes  her  point  in  a  new  book  titled  Naked  in  Cyberspace: 
How  to  Find  Personal  Information  Online  (Pemberton  Press, 
$29.95). 

The  book  has  more  than  500  pages  detailing  how  information 
professionals  dig  up  data,  and  where  they  turn  when  they  hit  a 
dead  end.  It  holds  a  few  privacy  lessons  too. 

In  addition  to  databases  with  identifying  information  and  public 
records  from  criminal  convictions  to  fishing  licenses.  Lane  cites 
sources  even  private  investigators  might  not  think  to  look  at,  such 
as  alumni  newsletters  posted  online  (for  news  about  marriages, 
births,  job  changes)  and  even  genealogical  bulletin  boards. 


9/18/97  11:02  AM 


You  are  a  database  and  access  abounds  http://167.8.29.8/plweb-cgi/idoc.p...E+USATODAY_ONLINE+NEWS-NHWS+-alex: 


Jobs  at  USA 


births,  job  changes)  and  even  genealogical  bulletin  boards. 


TODAY  "Genealogists  are  very  generous  about  sharing  data,  and 

oftentimes  it  includes  who  the  family  members  are  now,  and  where 
they  are,"  she  says. 

Genealogy  may  sound  innocuous,  but  it  could  expose  information 
that  compromises  privacy:  mother's  maiden  name,  for  example,  is 
often  a  key  to  sensitive  financial  data. 

"I  don't  remember  ever  seeing  mothers'  maiden  names  in  any  site 
I've  seen,"  Sterner  says.  "I  would  consider  that  to  be  the  equivalent 
of  a  PIN  for  information." 

Among  other  personal  information  Lane  cites  online: 

•  Phone  directories.  Many  are  free  on  the  Web;  some  have 
enhanced  features  that  let  you  type  in  a  phone  number  to  get 
name  and  address  or  print  out  a  map  of  the  neighborhood. 
Some  even  include  unlisted  phone  numbers.  While  these  are 
not  given  out  by  phone  companies,  they  can  be  gleaned  from 
other  sources,  such  as  information  fi-om  product  warranty 
cards,  which  is  often  sold  to  direct  mailers. 

•  Marketing  lists.  Companies  on  the  Web  advertise  all  flavors 
of  lists  that  allow  advertisers  to  target  specific  types  of 
consumers,  from  occupations  (Database  America  touts  250) 
to  people  who  have  moved  recently,  are  "upscale 
grandparents"  or  own  pets.  "Even  pharmacies  sell  databases, 
which  shocked  me,"  Lane  says. 

•  Internet  discussion  logs.  Web  sites  including  Deja  News 
(http://www.dejanews.com)  keep  searchable  records  of 
ongoing  conversations  on  thousands  of  topics,  from  TV 
shows  to  business  to  quilting. 

A  student  might  post  uncensored  anecdotes  or  opinions,  "not 
realizing  that  somebody  years  later,  perhaps  a  potential  employer, 
might  learn  they  had  done  things  they  were  embarrassed  by,"  Lane 
says.  "They  might  never  tell  you  what  they  saw,  but  they  might 
toss  your  resume  before  ever  calling  you." 

It's  now  possible  to  specify  that  a  posting  in  an  Internet  discussion 
or  "newsgroup"  not  be  archived,  but  not  everyone  knows  this,  and 
many  others  just  don't  bother. 

•  Home  pages.  Do-it-yourself  personal  pages  proliferate  on 
the  Web;  tools  to  create  them  and  places  to  publish  them  are 
now  free  with  many  e-mail  accounts.  And  people  post  all 
kinds  of  personal  information  "not  thinking  how  it  could  be 
misused,"  Lane  says. 

Examples  include  everything  from  resumes  with  home  address, 
phone  and  hobbies  to  pictures  of  people's  kids. 

"I  wouldn't  post  my  phone  number  and  address  at  the  local  grocery 
store  bulletin  board,"  Lane  says.  "So  why  would  I  post  it  online  on 
the  whole  world's  bulletin  board?" 
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Home  pages  can  be  changed  or  taken  down,  of  course,  but  even 
now  "snapshots"  are  being  taken  of  what's  on  the  Web  at  any  given 
time. 

Brewster  Kahle,  known  for  designing  the  first  Net  pubHshing 
system,  WAIS,  is  now  creating  an  Internet  Archive 
(http://www.archive.org). 

His  digital  robots  crawl  the  Web  making  copies  of  every  page  so 
scholars  and  historians  in  the  future  will  be  able  to  study  the  early 
days  of  the  Net  in  the  same  way  they  now  use  letters  and  diaries  to 
study  the  past. 

"The  wonder  of  the  Internet  is  it's  changing  rapidly,"  Kahle  says. 
With  the  archive,  "you  can  dial  the  time  you  want"  and  see  what 
was  there. 

People  seeking  specific  personal  information  wouldn't  want  to 
download  the  massive  archive  files,  but  Kahle  plans  this  summer 
to  launch  a  library-type  service  called  Alexa,  which  will 
automatically  direct  browsing  software  to  grab  a  recent  copy  of  a 
page  if  it  can't  be  accessed  directly. 

His  goal  is  simply  to  make  the  Web  "more  reliable  and  less  flaky," 
but  Kahle  obviously  sees  privacy  implications. 

So  far,  though,  he  says,  "The  spooky  aspects  haven't  cropped  up." 

And  if  anybody  doesn't  want  his  page  archived,  Kahle  doesn't 
argue.  "We  take  it  off" 

By  Leslie  Miller,  USA  TODAY 
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Alexa  1.0 

Freeware 


Alexa  is  a  web  navigation  service  that  helps  you  surf  the  web  smarter, 
faster,  and  easier.  The  sen/ice  is  displayed  as  a  toolbar  that  works  with 
your  web  browser  (Internet  Explorer  or  Netscape)  to  provide  a  continuous 
source  of  relevant  recommendations  of  where  to  go  next  on  the  web. 

With  Alexa,  you  learn  more  about  each  site  you  visit,  including  who  owns 
it,  how  many  pages  it  has.  how  frequently  it  is  updated,  and  its  popularity 
among  other  Alexa  users.  Also,  there  is  a  list  of  links  corresponding  to 
each  site  that  represent  places  you  probably  would  like  to  visit.  Alexa  also 
has  support  for  instant  messages  and  searching  Encyclopedia  Britannica 
Online  and  Merriam-Webster's  Dictionary  and  Thesaurus. 
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Publisher:  Alexa  Internet  Vision 


Download  32-bit  version  (1 .82M) 


Platforms:  Win  95.  Win  NT  4.0 
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Figure  2.  InterNex's  Network 


We  operate  with  two  licenses  of  Oracle  7.3.3.  The  databases  are  synchronized,  but  independent.  Again,  if 
one  system  is  rendered  unavailable,  the  second  system  will  take  over  automatically. 

TRUSTe  runs  three  redundant  Apache  Web  servers,  but  will  soon  move  to  Oracle's  Application  Server  and 
implement  Oracle's  Payment  Cartridge. 


Alexa  &  WorldPages  'Point  to'  TRUSTe  Licensees 

Alexa  Internet,  a  web  navigation  service,  is  working  with  TRUSTe  on  a  service  that  will  readily  identify 
TRUSTe  licensees  to  web  users.  Alexa  works  parallel  to,  but  independently  of,  web  browsers  and  is 
displayed  as  a  toolbar  on  the  bottom  of  the  user's  screen.  In  a  nutshell,  the  bar  knows  where  the  user's 
browser  is,  describes  "where  you  are",  and  provides  input  about  "where  to  go  next."  A  "where  you  are" 
pop-up  box  displays  meta  data  for  the  site — including  the  TRUSTe  logo  of  licensees. 

Just  as  a  recognizable  trustmark  can  stimulate  commerce,  we  believe  TRUSTe  can  boost  traffic  to  member 
web  sites.  How?  Through  an  authorized  directory — the  "safe"  place  to  browse.  WorldPages.  a  leader  in 
Internet  database  look-up  services,  is  working  with  us  to  develop  the  directory.  WorldPages  will  contribute 
web  page  design,  a  "million  banner  impressions"  to  drive  traffic  to  the  directory,  promotion  to  all  major 
search  engines  and  sites,  a  "link  to  us"  program,  and  home  page  links  to  TRUSTe.org.  All  TRUSTe 
licensees  will  be  featured  free  via  a  searchable,  alphabetical  categorization  system. 
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THIS  ISSUE 


73.  It  is  not  a  language's  weakness  but  its 

strengths  that  control  the  gradient  of  its  change: 

Alas,  a  language  never  escapes  its  embryonic  sac. 

—  Alan  J.  Perils,  "Epigrams  in  Programming", 

ACM  SIGPLAN,  September,  1982. 
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AGENTNEWS 

Agents'98 


.\uents'98.  the  Second  International  Conference  on  Autonomous 
.\gents,  will  be  held  Sunday  May  10-Wednesday  May  13,  1998 
in  Minneapolis/St  Paul.  Agents'98  is  sponsored  by 
ACM/SIGART  and  in  cooperation  with  AAAI.  Papers  are  due 
on  October  1,  1997  with  instructions  available  at 
I  .ip://www. aaai.org/Conferences/AA98ss/submit.html  .  For 
information  about  proposing  tutorials,  workshops, 
cicmonstrations  and  exhibits  see  the  Agents'98  web  page.  Send 
L  mail  to  agents-98-subscribe@cs.umbc.edu  to  be  automatically 
.;  J.ded  to  a  mailing  list  for  occasional  news  about  the  conference. 


Positions  available 


Now  agent-related  positions  were  posted  by  USC/ISI.  The 

■  :iiversity  of  Calgary.  Department  of  Mechanical  Engineering. 

Kmetic.  Charles  River  Analytics.  FCI  Communications. 

2:  lanchester  Metropolitan  University  .  ZUNO  (UK),  and  IS 

Kobotics 


AGENT TECHNOLOCY 


Jess  3.0 


!  inest  Friedman-Hill  of  Sandia  has  released  Jess  3.0.  This  is  a 
i'nal  release  of  Jess,  a  clone  of  the  popular  CLIPS  expert  system 
.-ricll  written  entirely  in  Java.  Jess  3.0  adds  significant  new 
:  jatures  like  multifields,  multislots,  and  incremental  reset, 
1  icludes  a  number  of  bug  fixes,  a  more  flexible  user  function 
incrface,  *lots*  of  new  built-in  functions,  and  a  general  code 

canup  and  reorganization  (a  'jess'  package.)  Jess  3.0  is 
t  Miipatible  with  all  versions  of  Java  starting  with  version  1.0.2. 
:    is  (in  particular)  Java  1.1.3  compatible. 
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Internet  (founded  in  April  1996  by  Brewster  Kahle  and 


Bruce  Gilliat  describes  itself  as  "The  company  gathers,  manages 
and  analyzes  multi-terabyte  collections  of  information  to  make 
information  access  and  navigation  easy  for  everyone."  Their  first 
product  is  Alexa  —  a  client  program  (as  opposed  to  a  plug-in  or 
proxy  server)  that  has  two  components  ~  the  toolbar  and  the 
contact  system.  As  you  browse  the  Web,  the  Alexa  toolbar 
contacts  their  servers  independently  and  separate  from  your  Web 
brow  ser.  Alexa  provides  the  following  services: 

.   Where  Am  I?  ~  Alexa  provides  context  for  a  site  visited 
by  offering  instant  background  information  (e.g.,  number 
of  pages,  frequency  of  updates,  number  of  links  into  the 
site,  popularity,  etc.)  so  you  can  decide  if  it  is  worthwhile. 

.   What  happened  to  my  page?  —  Alexa  gives  you  automatic 
access  to  unavailable  web  pages  through  their  archive, 
eliminating  the  problem  of  "404  Not  Found"  messages. 

.  Where  should  I  go  next?  —  Wherever  you  travel  online, 
Alexa  is  there  with  suggestions  hot-liiiked  to  places  you 
might  want  to  visit  next. 

.   Who  else  is  online?  -  Alexa  tells  you  instantly  which  of 
your  fnends  and  colleagues  are  currently  online,  and  lets 
yoLi  send  an  instant  message  so  you  can  communicate  with 
them  in  real-time. 

The  first  two  services  are  provided  using  their  own  archive  of  the 
Web  which,  as  of  July  1997,  is  in  excess  of  five  Terabytes. 
Alexa  began  collecting  the  Web  in  early  1996  and  now  has  at 
least  one  snapshot  of  over  500K  web  sites.  They  adjust  site  visits 
to  retlect  their  apparent  speed  of  change  and  estimate  that  they 
ha\  e  a  new  snapshot  approximately  every  60  days.  The  browsing 
suggestions  are  computing  using  collaborative  filtering  over  the 
database  of  paths  taken  by  Alexa  users  (with  suitable  privacy 
safc;juards). 

Search  Engine  Watch  is  a  nice  collection  of  information  on  web 
search  engines.  Creator  and  maintainer  Danny  Sullivan,  of 
Caiafia  Consulting,  say  "It  contains  all  that  "A  Webmaster's 
Guide  to  Search  Engines"  had,  plus  it  is  more  accessible  to 
search  engine  users.  The  site  remains  grounded  in  the  technical. 
How  do  these  things  work,  or  more  specifically,  how  well  do 
these  things  work.  It  also  remains  committed  to  tracking  search 
engine  news,  important  changes,  and  providing  quality 
information  about  these  important  tools.  The  information  from 
continuing  studies,  search  engine  help  pages,  articles,  reviews, 
books,  tips  from  others,  feedback  received  directly  from  the 
\  anous  search  engines,  and  interviews  I  conduct  in  the  course  of 
co\  cring  the  search  engine  beat."  They  also  run  a  maihng  list 
de\  oted  to  search  engines. 
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Silicon  start-ups  on  the 
silver  screen 


Sept.  8,  1997 
By  Chris  Nolan 

Mercury  News  Staff  Writer 


SO  you  think  ya  oughta  be  in  pictures,  eh? 

Well,  you're  not  alone.  Start-up  veterans  Eric  Swildens  and  Jay  Capela  are 

making  a  movie  about  their  --  and  maybe  even  your  --  lives.  But  unlike  many 
of  those  who  envision  immortality  in  pictures,  Swildens,  the  executive 
producer,  and  Capela,  the  producer  and  director,  have  the  money  and 
experience  to  make  this  film. 

The  film's  workmg  title  is  WiId@Start  -  but  this  is  no  vaguely  hallucinatory 
David  Lynch  tour  of  America.  No,  it's  going  to  be  a  real-life,  warp-speed  tour 
of  life  in  a  Silicon  Valley  start-up.  It's  going  to  be  a  documentary  about  real 
companies  and  real  people.  In  fact,  said  Swildens  and  Capela,  making  a  movie 
is  a  lot  like  starting  a  company  (To  make  the  comparison  complete,  there's  a 
Web  site. 

"There's  a  lot  of  excitement  and  terror,"  said  Swildens.  "All  the  time,  it's 
'We're  doomed'  or  'I'm  the  richest  person  in  the  world.' "  Swildens,  30,  isn't 
the  richest  person  in  the  world  or  even  the  valley.  He'll  cautiously  admit  to 
being  a  millionaire  as  a  result  of  selling  his  start-up,  Microline  Software,  to 
Neuron  Data.  "The  highs  and  the  lows,"  he  said,  "were  really  high  and  low." 

Director  Capela,  a  film  school  veteran  who  worked  for  a  while  at  Swell 
Software,  a  start-up  since  sold  to  Adobe  Systems,  said  he's  looking  for  the 
right  balance  of  start-ups  to  feature  in  the  movie.  Capela  said  he's  talked  with 
a  couple  of  high-profile  companies  like  Alexa  Internet,  EBay  and 
Crossroute.  And  the  movie  hopes  to  depict  all  the  players  in  the  start-up 
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process  -  from  venture  capitalists  to  lawyers  to  established  companies.  Oh, 
and  just  like  a  start-up,  Swildens  and  Capela  are  looking  for  investors,  too. 

Wild@ Start  is  scheduled  to  start  filming  --  this  is  intended  to  be  a  theatrical 
release  --  in  about  four  weeks,  said  Capela. 

WHERE  DO  YOU  WANT  TO  GO  TODAY?  They  had  a  contest  to  rename 
cue  International  --  the  software  conglomerate  that  earlier  this  year 
purchased  Berkeley  Systems  and  Books  That  Work  --  but  no  one  won. 

It's  too  bad,  too,  because  the  winner  would  have  gotten  dinner  for  two, 
anywhere  --  and  they  meant  anywhere  --  in  the  world.  The  contest  was  open  to 
the  15,000  employees  of  CUC,  which  is  based  in  Stamford,  Conn.,  although 
Books  That  Work  and  Berkeley  are  Bay  Area  companies.  CUC  got  some 
10,000  entries,  according  to  spokeswoman  Sue  Tobin.  ''There  were  some 
really  great  names.  We'd  narrow  it  down  to  a  few,  but  they'd  be  taken,"  she 
said. 

The  company  couldn't  take  a  name  that's  already  registered  with  the  U.S. 
Patent  and  Trademark  Office.  Names  that  were  spelled  different  but  sounded 
alike  were  out,  too.  Finally  CUC  jettisoned  all  the  submissions,  and  resorted 

to  an  identity  company. 

There  were  rewards  for  good-faith  efforts,  though.  Runners-up  got  American 
Express  gift  certificates  worth  $150  to  use  for  dinner,  said  Tobin.  They  just 
had  to  pay  their  own  transportation  to  and  from. 

And  cue's  new  name?  It's  Cendant.  The  change  is  official  later  this  month. 

IMAGE  OR  REALITY:  Two  Silicon  Valley  companies  have  taken  to  television 
to  sell  their  wares.  But  this  isn't  the  usual  pre-Christmas,  back-to-school 
computer  sales  rush. 

Instead,  the  ads  from  Cybercash  and  CrossRoads  Software  push  products 
the  average  household  will  never  want  or  need:  digital  commerce  technology 
from  Cybercash  and  integration  software  from  CrossRoads. 

The  ads  are  running  on  networks  with  a  rather  particular  audience  --  CNN, 
CNN/fn  and  CNBC,  whose  up-to-the  minute  breaking  news  and  stock  tickers 
serve  as  the  background  video  of  preference  for  brokerages,  investment  banks 
and,  we  confess,  newsrooms.  These  are  the  kinds  of  places  that  can 
dramatically  affect  a  company's  stock  price. 

~  The  market  is  largely  investors  or  brokers  and  people  who  are  connected  to 
the  financial  community,"  said  Brian  Rolfe,  a  spokesman  for  Cybercash.  He 
said  the  ads  are  also  targeted  at  potential  customers  -  CEOs,  CFOs  and  CIOs. 
Tt's  both.  It's  absolutely  both,"  he  said.  ''Who's  going  to  get  it  the  most,  I'm 
curious  to  find  out." 

Not  surprisingly,  there's  an  agenda  here:  CrossRoads  wants  to  go  public.  And 
Cybercash  -  called  Cybercrash  by  some  investors  because  the  pnce  has  fallen 
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SO  far,  so  fast  --  needs  help  getting  its  stock  back  to  something  like  the 
$40.50-a-share  high  of  a  year  ago.  It's  now  trading  at  about  $18  a  share. 

"You're  right,  it's  not  a  very  typical  path,"  said  CrossRoads  CEO  Katrina 
Garnett.  But  it's  not  expensive  and  it  might  work,  she  said.  "I'd  rather  do  it 
this  way,  than  spend  $100,000  on  a  page  in  the  Journal  for  one  day." 

Got  a  tip?  A  story?  A  good  party?  Pick  up  and  phone  and  dial  or  log  on  and 
start  typing.  You  can  reach  me  at  the  San  Jose  Mercury  News,  750  Ridder 
Park  Drive,  San  Jose,  Calif.,  95190;  408)920-5490;  or  e-mail 
C Nolan  @sjmercury.com.  Talk  is  Cheap  appears  Mondays  and  Thursdays. 

Posted  at  6:51  p.m.  PDT Sunday,  September  7,  1997 


I  Mercury  Center  Home  I  Index  I  News  Library  I  Feedback.  I  ©1996-7  Mercury  Center.  The  information  you  receive  on-line  from 
Mercury  Center  is  protected  by  the  copyright  laws  of  the  United  States.  The  copyright  laws  prohibit  any  copying,  redistributmg, 
retransmitting,  or  repurposmg  of  any  copyright-protected  material. 
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WINDOW  MANAOER  •  BRIAN  LIVINGSTON 


Alexa  provides 
additional  ways  to 
navigate  tlie  Web 


low  p: 


Nmv  a  new,  tree 
'..lu  vital  informain 
ular  a  Web  site  is  ar.J  how  m, 
people  liked  and  disliked  th 
iiteraetions  with  ;•.  E'/en  bett^ 
you  can  jump  to  related  sites  without 
having  to  resort  to  a  key-Avord  search,  and 


I 


vou  can  retrieve  old' 


pa^es  that  have  dis- 


iNDOws  use;rs  are  increasinglvex- 
'ploring  the  World  Wide  Web/Pewer 
than  20  percent  of  Windows  PCs  don't 
have  Internet  access,  according  to  a  re- 
rent  Microsoft  press  brieting.  But  look- 
ing at  a  Web  site  and  having  confidence  that 
the  site  is  legitimate  and  trustworth'.  are  twn 
different  things. 


appeared  from  the  Net. 
Alexa  is  a  relatively  small  free- 
ware download  (918KB  in  sue), 
that  works  with  Netscape  Nv- 
igator  3.0  or  later  and  Interne; 
Explorer  3.0  or  later  on  Windows  95 
and  Windows  NT.  (If  you  are  using  Windows 
95,  vou  need  a  version  of  the  file  Kernel32.dll 
dated  later  than  Feb.  2,  1996.  Vou  can  di  .vn- 
load  the  updated  version  from  Ale.xa's  site.) 


When  you  install  Alexa,  a  small  toolbar 
appears  at  the  bottom  of  your  screen  when 
your  browser  is  active.  While  you're  viewing  a 
Web  site,  clicking  the  Where  Am  I  button  on 
Alexa's  toolbar  reveals  a  wealth  of  information. 

Leading  the  display  is  a  ranking  of  the  num- 
ber of  hits  the  site  receives  based  on  an  analy- 
sis of  traffic  across  the  Internet  backbone. 

Even  more  useful  is  the  name,  address,  and 
phone  number  of  the  site's  owner. 

This  information  is  retrieved  from  the 
domain  registrar  InterNIC  and  includes  the 
length  of  time  the  domain  has  been  in  exis- 
tence. Because  longevity  alone  is  not  a  good 
indicator  of  an  Internet  business'  reliability, 
,\lexa  collects  votes  from  its  users  on  whether 
they  liked  a  site.  Unfortunately,  you  can't 
learn  why  visitors  voted  as  they  did.  But  in 
one  case,  for  e.xample,  a  particular  travel  ser- 
vice developed  a  large  number  of  "dislike" 
votes  —  perhaps  a  hint  that  you  should  con- 
sider a  different  service  for  \-our  travel  needs. 

'  )ne  of  the  most  remarkable  features  of 
.Uexa  is  that  it  can  retrieve  old  Web  pages  that 
have  moved  or  changed.  When  you  see  the 
message  "404  Not  Found"  in  your  browser, 
you  click  the  Archive  of  the  Web  button  and 
Ale.xa  will  attempt  to  display  the  most  recent 
archived  version  of  that  Web  page. 

This  feature  grew  out  of  the  Internet 
Archive  project,  an  effort  to  preserve  Web 
pages  for  posterity.  Officials  at  Alexa's  parent 
company,  San  Francisco-based  Alexa  Inter- 
net, say  it  holds  three  different  "snapshots"  ot 
more  than  500,000  Web  sites  that  have  been 
collected  since  early  1996.  This  information 
occupies  8  terabytes  (8,000GB)  of  storage  — 
equivalent  to  all  of  the  data  in  all  of  the  tapes 
in  a  typical  video  store. 


Alexa  uses  the  links  between  sites,  as  weii 
as  Its  analysis  of  how  users  jump  from  site  to 
site,  to  determine  other  sites  that  may  be  rel- 
e\ant  to  the  site  you're  currently  viewing. 
Clicking  Alexa's  Where  to  Go  Next  button 
lists  these  suggestions. The  links  aren't  always 
pertinent,  but  they're  a  start.  Alexa  supports 
itself  with  small  advertisements  in  this  box. 

You  don't  even  need  Alexa  to  use  some 
ot  its  data.  For  example,  set  your  browser  to 
http://widener.alexa.com/sitedata/yahoo 
.com  to  see  contact  and  rating  information 
on  the  Yahoo  search  engine  site.  Change 
yahoo.com  to  any  site  vou  wish. 

But  the  d(-)wnload  is  well  worth  it.  Go  to 
http://www.alexa.com/download.  I  would 
like  to  thank  Danny  Sullivan  at  http:  search 
engine.com  for  this  suggestion. 


BiiiUi  Livin;^iton  n  the  co-author  ofievcral 
best-selling  Windows  books,  including  the 
most  recent  Windows  95  Secrets  (IDG 
Books).  Send  tips  to  brian_livingston@ 
tntoworld.com.  He  regrets  that  he  cannot 
an>wer  individual  questions. 
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dey-elops  new  way  to  search  Web 


r-.v  It  ^'yc?-:(S 


a  C: 


•2' or  0-1  s.^ip- 


?5  i  .*r>'^crr  'cr  2  conventional 
;e:-cn  a-qne  .v  ■!  r-od'^c9  nun- 
cre-Jo  cf  ;r-c->.s£>rc3  of  sites.  :r»c:ud- 
rg  J-S  3  A.e-a  s  ;ec--^c'ogy, 
s;?'..ng  from  ra  J'^S  3ite.  will  look 

•  S  ■^I'-f.'  .vc'cs  ard  ^^rases  at 

t'h?-  3:s3,  ?or  i-star.c3  Aiaxa 
-  y  [  'i=G    rsi.  ers  ir'e  oackage  ' 

-;•  ,ir'ta-  yO'jr  •r3C<i'"g  nufr.oer" 
"-r-  ■'e  'J'^S  5'te  arc  d'reci  you 
0  -ea  E  s  s  ta.  -vr^'r'i  also  con- 
•a.'s  ■D'"-cs53  :.csc  :c  -nat 

•  0:"'a'  ^^9~  3.;si  ;ra;  •"■ava 
■•.-;.9a'  '0  T9  'JOS  v'.e  ard 
".Tl-ce  ntormaiicn  on  ether  rele- 

•  "rartic  o?n9r-is  -i.'e:;?.  tracKs 
wne'O  usars  .vara  ze^ore  amving 
a'  UPS  and  /^nere  ihey  werit  after- 
wa.-'i.  integra'jng  these  panems 
■n'.c  'rs  guioarce  s>;tsm  it  Federal 
E-.::re55  .vore  the  ~ios;  cnrnmcn 
3i;?  wne^e  iSe^s  -v^rt  ng-t.  it 

AC  jic  'ikaiy  ^ccsar  25  a  suggest- 

St  3iI9  :0  TC-.3  'Z 

-iMsf  orcc333.rg  tf.e  aCcve  infor- 
-T-ation,  Ai9:'a  suggests  iO  3'tes. 
='on  the  U'^S  i.t?  -.6.-3  'Qcentiy 
STsered  jse'S  'o 

•FedE;;. 

•  DHLV''o-:d".vica  Excrass 
•U  «=   =cs:r'  ==— 'c- 

•  5-1-  ^-jr-.:"    '.V3?n    TT.Fis- 
:,c-:;ic^  Carprrrqr-; 

•,  ca.-^  ;;ad::cvc   ? 'Tutch  cor- 
•  =  i-a-  «  vrjjrig  corrc^-.ry 

jc:$  ?.'  for  e?cn  siwC',3;;sc  j.te. 

31:9  :p?  scees  0'  '3  con^cutars. 
•r:  -I.  -rcer  oi  Oig?s  n  r-ia  sita 
art  2  T.-rg  Tor^  c"h3r  Aiexa 


9y  RAJIV  CHANDRAoEKARAN 


San  FSaNCISCO  —  Foi  niorff  :hjn  i  -far  j  tcjni  or 
■r^puter  indUilf"-  snLrrpreneuri  ha.^  been  j-,in;.'  1  rcointui 
I  .■ornpute:>  '0  ^i\c<  ev<r\  World  Wid*  Wrb  ?ii,'tf  'hev  .an 


I  he:.-  nt..5MVf  (Ji^itj!  library  on^-injlly  wj-i  '  K^vtd  js  4 
-tu;->.  '•enture  ic  isstfmble  an  Iiiteir.ei  nn:e  capvjie  et 
inipiitef  netwoik  s  eaxly  days 


the  ^lubal  < 


3ut  tlii  tl<i.'.run;c  Iibrar.aiu  appeax  10  have  tour.d  j  n.ore 
i-nmsdiate  —  and  pytentially  protjiabic  —  jje  tor  their  2 
lnl!iun--.^irjc;cr  pile  of  djt.i  a  new  and  rrrijise  *ay  tor 
ordinary  con-.p^it;:  -jicrj  to  naviyatf  the  (prl'vimg  'Wr«b 

To  t'lr.d  •.vnii  ihey  :e  looking  for  on  ;lie  'Aeb.  moit  ;om- 
pj;er  jjers  plu^  'fcevwordJ  into  sorcrj  tniincs  —  \^eb 
Mtei.  ^it.'i  nanr.ei  jutn  as  Inrojeek  ind  E<ciie  that  look  tor 
otner  W-t;  ,:te5  containing  tfiat  key«orJ  ard  then  generate 
1  li3t  of  siit'j.  That  can  produce  3  long  liii  :o  wade  thfoujh 
and  the  sezic'^  angint!.  of^en  miss  relevant  sues  because  the 
iceywofd  IS  slightly  off  target 

But  the  jntemct  Archive's  service  —  called  Alexa.  after 
the  ancient  library  jt  Ale\andna.  Egypt  —  doesn't  rely  on 
a  word  '<i/c.h 

i'niiead.  A.'<\a.  a  lottvurc  add-on  for  '-Veb  browsers  tha; 
-■.in  be  downloaded  tronr. 
L)t:p.7.v-7\v.-)c"t3.cofn 

j-e<;  >ev?r:,l  power'u!  romputers  to  crunch  through  the 


archived  data  —  a  procsjs  called  daia  min<n^  —  and  '■h. 
tor  3-ttem>  withm  the  Web.  jlf.maieK  providmi  a  fev.  -r 
ommeuded  strands  to  travel 

Frorti  L'mied  Pircel  5<;r^ice  s  Wetj  jiie  tor  in>i:nw. 
Aleta  points  the  u»er  to  >ite\  for  Federal  Express  the  L'  *■ 
Postal  Service  and  DHL  W-brldwide  E^prevs  Siartint  at  -: 
V.cb  piic  or  cai.'.i3<  :!oihui«  retailer  Lands  End  iitej  ;■ 
I  Creiv   L  L   Bean  and  Victoria's  Secret  axe  suygeited 

Sometimes  ;t  j  jpookv  how  helpful  it  can  be      -ii 
Brewster  Kahle  the  irchne  s  founder  and  the  prejiden; 
AIe'<a  InttftTiet  Inc    a  San  F.-aactsco  >tan-up  .lompanv  -r 
intenCs  10  offer  the  new  navuai-.on  service  for  free  10  ^^c 
kne-i 

Thii  .>  not  the  r'^rst  nme  someone  has  claimed  to  n.  ■ 
cr3c.<ed  the  code  .'at  iialan*  the  '^t^  easily  seirch;b.  • 
A.id  ;t  >  not  clea;  whether  Ale t a.  which  is  beinz  te.^ied  ; 
about  ;0  ceo  people  now.  would  function  well  if  mjl.'ic 
of  pecplc  :ned  to  use  it.  But  in  m  early  form  it  s  %:::- . 
generally  jpbcat  reviews  la  the  computer  mduitt^' 

'Sej/t:h  engines  can  drown  you  in  documents.  '  -;■ . 
Jerry  Michaislci.  manafing  editor  of  Release   1.0.  a  ."•< 
Yorn-based  computer  industry  newjiencr. 

Aikjni  the  Infoseek  engine  to  find  sites  *ith  the  We- 
wofO       shipping,'    for    example     generates    nnore    it\^-i 
29;.0CO  citations  with  sues  for  a  Bntish  per.  and  an  ere  ■ 
ic  bird  store  Jt  tte  ;op  of  ;he  list. 

'Ale'.a  is  a  big  step  forward."  Michalslcj  said.  "It  i  li><! 
hiving  a  huge  associanve  memory  that's  looked  at  moii  •  : 
the  Lr.teme;   ' 
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Navigate  Internet       ^^ 
to  track  packages      .*•- 

.  If  you  want  information  on^*' 
shipping  a  package,  usin^ 
"shipping"  as  a  keyword  for  a 
conventional  search  engine  will 
produce  hundreds  of  thousands 
of  sites,  including  UPS's.  Al; 
exa  i  technology,  starting  from 
the  UPS  site,  will  look  for. 

■  Similar  words  and  phrases 
at    other    sites.    For    instance. 
Ale.xa  mjght  flag  "delivers  the    • 
package"     or     "enter     your 
tracking    number"     from    the    ■' 
UPS  site  and  direct  you  to  Fed     - 
Ex$   site,   which   also  contains 
phrases  close  to  that. 

■  Other  Web  sites  that  have 
"linked"  to  the  UPS  site  and 
include  information  on  other 
relevant  home  pages 

■  Traffic    patterns      Aiexa 
tracks  where  users  were  before 
arriving    at    UPS    and    where 
they     went     afterward,     inte-    '* 
grating  these  patterns  into  ila 
g^Iidance    system.     If    Federal  ~l 
Express      were  _  the      most      ' 
common  site  wh^'t  users  went 
next,  it  would  likely  appear  as      1 
a  suggested  site  to  move  to      //     | 
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Internet 
searches 
improve 

Alexa  doesn't  rely 
on  words  alone 

By  RAJIV  CHANDRASEKARAN 

The  WaSMngTor  Post . 

SAN  FR.ANCISCO  -  For  more 
ihar.  a  year,  a  team  of  computer 
ind'jstry  entrepreneurs  has  been 
using  a  roomful  of  computers  to 
copy  every  World  Wide  Web  page 
•hev  can  find. 

Their  massive  digital  library 
originally  was  viewed  as  a  quixotic 
venture  to  assemble  an  Internet 
t:me  capsule  ot"  the  global  computer 
nerwork'3  early  days 

But  the  electronic  librarians 
appear  to  have  found  a  more  imme- 
diate -  and  potentially  profitable  - 
use  for  their  2  triilion-character  pile 
of  data;  a  new  and  precise  way  for 
ordinary  computer  users  to  navi- 
gate the  sprawlmg  Web 

To  find  wfta:  they're  looking  for 
on  the  Web,  most  computer  users 
plug  "keywords'*  into  search 
engines  -  Web  sites,  with  names 
such  as  In^oseek  and  Excite,  that 
look  for  o'ther  Web  sites  containing 
that  keyword  and  then  generate  a 
list  of  sites  That  can  produce  a 
long  list  to  wade  through,  and  the 
search  engines  often  miss  relevant 
sites  because  the  keyword  is  slight- 
Iv  off  target . 


But  the  Internet  Archive's  ser- 
vice -  called  Alexa.  after  the 
ancient  library  at  Alexandria. 
Egypt  -  doesn't  rely  on  a  word 
search.  Instead,  AJexa,  a  software 
add-on  for  Web  browsers  that  can 
be  downloaded  from 

http  //ww%v.alexa.com,  uses  several 
powerful  computers  to  crunch 
through  the  archived  datn  -  a 
process  called  data  mining  -  and 
look  for  patterns  within  the  Web, 
ultimately  providing  a  few  recom- 
mended strands  to  travel. 

From  United  Parcel  Service's 
Web  site,  for  instance,  AJexa  points 
the  user  to  sites  for  Federal 
Express,  the  U.S.  Postal  Service 
and  DHL  Worldwide  Express. 
Surting  at  the  Web  page  of  catalog 
clothing  retailer  Lands'  End.  sites 
for  J  Crew,  L.L.  Bean  and  Victo- 
ria's Secret  are  suggested 

••Sometimes  its  spooky  how 
helpful  it  can  be,"  said  Brewster 
Kahle,  the  archive's  founder  and 
the  president  of  AJexa  Internet  Inc.. 
a  San  Francisco  startup  company 
that  intends  to  offer  the  new  navi- 
gation service  for  free  to  Web 
users.  y 
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New  serv  ice  makes  it 
easier  to  surf  the  Web 


,  BY  RAJIV  CHAMOfUSOCaRAN 

I  SAN  FRANXISCO  -  For  more 
thar.  a  year,  a  team  of  computer 
I  industry  entrepreneurs  has  been 
I  using  a  roomful  of  compucers  to 
copy  every  World  Wide  Web  page 
j   the;,  can  find. 

'       Their  massive  digital  library 

originally    was     viewed     as    a 

!  qdixotic  venture  :o  assemble  an 

i    [r.rernet    time    capsule    of    the 

glDba!  computer  nef^ork's  earlv 

days 

;       3ur  the  electronic  librarians 

j   appear  to  have  found  a  mere  im- 

:   media:?  —  and  potennally  prof 

■   'table  —  use  for  their  2  trillion- 

I  character  pile  of  data:  a  new  and 

!  precise  way  for  ordinary  com 

rcter     users     to     navigate     the 

,   sprawling  Web 

I       To  find  wha-  they  re  looking 
:  for  on  the  Web.  most  computer 
users     plug      'Xevword;"     into 
'  search  engines  —  Web  sites,  with 
nam.es  such  as  Lnfoseek.  and  E.x 
cite,  tharlook  for  other  Web  sites 
;   containing    that    keyword    and 
then  generate  a  list  of  sites  That 
I  can  produce  a  long  list  to  wade 
through,  and  the  search  engines 
often  m.tss  relevant  sites  because 
the  keyword  is  slightly  off  target 
But  the  Internet  Archive's  ser 
v::e  —  called  Alexa.  after  the  an- 
cient    library     a:     Alexandria. 
£g-vpt  —  dcesn  t  rely  on  a  word 
search 

Instead.  Alexa,  a  soft'-vare  add- 
on fcr  V.eb  brov.sers  that  can  be 
doAT.loaded  from 

r.r.p  '.vwvv  alexa  com.  uses  sev- 
eral powerful  computers  to 
cniicn  through  the  archived 
la:a  —  a  process  called  da'a  min 
.,',?  —  and  lock  for  patterns 
■.-  :;h;!"  -he  Web.  ultimately  pro- 
'.  :dirig  a  few  recom.mended 
it.  ir.ii  ;■:  tr.v/ei 

From.  L'nrsc  Par-Te'.  Services 
'.'.>b  3:te  fo;-  instance  .A.le\a 
ccir.:s  the  user  :o  >-tes  for  Fed- 


Alexa  analyzes  keyword  traffic 


If  you  want  information  on 
shipping  a  package,  using 
"shipping  as  a  keyword  for  a 
conventional  search  engine 
will  produce  hundreds  of  thou 
sands  of  sites,  including  UPSs 
.Alexa 's  tecrj-.ology.  starting 
from  the  UPS  site,  will  look 
for, 

■  Sim.ilar  words  and 
phrases  at  other  sites  For  in- 
stance. Alexa  might  :lag  "de 
livers  the  package  '  or  "enter 
your  tracking  num.ber"  from 
the  UPS  sue  and  direct  you  to 
Fed  Exs  sue.  which  also  con- 
tains phrases  close  to  that. 

■  Other  Web  sites  that  have 
"linked"  to  -he  UPS  site  and  in 

elude  inform.ation  on  other  rel 
evant  hom.e  pages. 

■  Traffic  patterns  Alexa 
tracks  where  users  were  be- 
fore amvi.ng  at  UPS  and 
where  thev  -A-ent  afterward,  in 


tegratmg  these  patterns  into 
its  g^Jldance  system.  If  Federal 
Express  v\ere  the  most  com- 
mon site  where  users  went 
re\t.  It  would  likel;-  appear  as 
3  suggested  site  to  move  to 

.-M'ter  processing  the  above 
information.  Alexa  suggests  10 
sites  From  the  UPS  site.  Alexa 
recentlv  steered  users  to. 

■  Fed  E.x. 

■  DHL  Worldwide  Express 

■  U.S.  Postal  Service 

■  Bellinghan.  Wash  . 
Transportation  Department 

■  P&O  .Vedlloyd.  a  Dutch 
container  shipping  company 

Alexa  also  provides  a  brief 
dossier  for  each  suggested  site, 
such  as  'Jie  company  running 
t.-.e  site,  the  speed  of  its  com- 
pv.ters  the  number  of  pages  m 
the  sire  and  a  rating  from 
c'her  Ale.xa  users. 


eral  Express,  the  U.S.  Postal  Ser- 
vice and  DHL  Worldwide  Ex 
press.  Starting  at  the  Web  page  of 
catalog  clothing  retailer  Lands' 
End,  sites  for  J  Crew.  L  L  Bean 
and  Victorias  Secret  are  sug 
gested 

"Sometimes  Us  spooky  hcv.- 
helpful  It  c:ir.  be.  '  said  Brewster 
Kahle.  the  archive's  founder  and 
the  president  of  Alexa  Internet 
Inc.  a  San  Franc:sco  startup 
company  that  intends  to  cfTer  the 
Pew  navigation  service  for  free 
to  Web  users 

This  IS  not  the  first  time  some 
one  has  claim.ed  to  have  cracked 
the  code  for  m.aXmg  the  Webeas- 
I'v  searchable  And  it  s  not  clejr 
'"-.-ether  .Alex.i  w.'^.ich  is  being 
'ested  b.v  ab-t-iu  10  000  people 
now.  would  fur.ction  well  vT  mil- 


lions of  people  tried  to  use  it.  But 
m  Its  early  form  it's  getting  gen 
erally  upbeat  reviews  in  the  com- 
puter industry 

Search  engines  can  drown 
you  in  documents.  ■  said  Jerry 
Michalski.  managing  editor  of 
P-^'.-ase  1  0  a  New  York  based 
computer  industry  newsletter 

Asking  the  Infoseek  engine  to 
f.nd  sites  with  the  keyword 
'Shipping  for  example,  gener 
ates  more  than  291.0QO  citations. 
with  sites  for  a  British  port  and 
^r.  exotic  bird  store  at  the  top  of 
the  hs; 

Alexa  is  a  big  step  forward." 
.'^lichalsk;  s.Tid  "It  5  like  having 
•1  huge  associair.  e  memory 
:-UTt  i  rook,.'d  at  most  of  the  inter- 
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Tnlernet  service  mines  mountain  of  data 


TVtE  WASHINGTON  POST 

SAN  FRANCISCO  —  For 
more  than  a  year,  a  team  of 
computer  Industry  entrcpre 
neurs  has  been  using  a  room/ul 
of  computers  to  copy  every 
World  Wide  Web  page  they  can 
find. 

Their  massive  digJtaJ  Ubrar>* 
orlglnaJly  was  viewed  as  a  qutx- 
otJc  venture  to  assemble  an  In 
temet  time  capsule  of  the  global 
computer  network's  early  days. 

But  the  electronic  librarians 
appear  to  have  found  a  more 
Immediate  —  and  potentially 
profitable  —  use  for  their  2  tril- 
lion character  pile  of  data:  a 
new  and  precise  way  for  ordi- 
nary computer  users  to  navigate 
the  sprawling  Web. 

To  find  what  they  re  looking 
for  on  the  Web.  most  computer 
users  plug  "keywords'  into 
search  engines  —  Web  sites, 
with  names  such  as  Infoseek 
and  Excite  —  that  look  tor 
other  Web  sites  containing  that 
keyword  and  then  generate  a 
list  of  sites. 

That  can  produce  a  long  list 
to  wade  through,  and  the  search 
engines  often  miss  relevant  sites 
because  the  keyword  Is  slightly 
off  target. 

But  the  Internet  Archive's 
service  —  called  Alcxa,  after  the 
ancient  ilbrary  at  AJexandiia. 
Egypt  —  docsn  t  rely  on  a  word 
search. 

Instead,  Aiexa,  a  software 
add-on  for  Web  browsers  that 
can  be  downloaded  from  http // 
www.alexa  com.  uses  several 
powerful  computers  to  crunch 
through  the  archived  data  —  a 
process  called  data  raining 
—  and  look  for  patterns  within 
the  Web.  ultimately  providing  a 
few  recommended  strands  to 
travel. 


Sometimes  it's 
spooky  how  helpful  it 
can  be. 

Brewster  Kahle, 

pfwident  Alexa  lntem«(  Inc. 


??- 

From  United  Parcel  Service's 
Web  site,  for  Instance,  AJcxa 
points  the  user  to  sites  for  Fed- 
eral Express,  the  U.S.  Postal 
Service  and  DHL  Worldwide  Ex- 
press Starting  at  the  Web  page 
of  catalog  clothing  retailer 
Lands  End.  Sites  for  J.  Crrw. 
L.L.  Bean  and  Victorias  Secret 
are  suggested. 

Sometimes  It's  spooky  how 
helpfjl  It  con  be,"  said  Brewster 
Kahle,  the  archive's  founder 
and  the  president  of  AJexa  In- 
ternet Inc..  a  San  Francisco 
start-up  company  that  intends 
to  offer  the  new  navi^tlon  ser- 
vice free  to  Web  users. 

This  IS  not  the  first  ttme 
someone  has  claimed  to  have 
cracked  the  code  for  making  the 
Web  easily  searchable.  And  it's 
not  clear  whether  AJexa,  which 
is  being  tested  by  about  10,000 
people  now.  would  function  well 
If  millions  of  people  tried  to  use 
It.  But  In  Its  early  fonn.  It's  get- 
UDg  generally  upbeat  reviews. 

Search  engines  can  drown 
you  In  documents,  ■  said  Jerry 
Mlchalskl,  managing  editor  of 
Release  1.0.  a  New  York-based 
computer  industry  newsletter. 

"Alc^ca  Is  a  big  step  forward." 
MlchaJski  said.  ' Its  like  having 
a  huge  associative  memory 
that's  looked  at  most  of  the  In- 
ternet" 

M 


How  Alexa's  searches  work 


TW€  WASHINGTON  POST 

If  you  want  Information  on 
shipping  a  package,  using 
"shipping"  as  a  keyword  for 
a  conventional  search  engine 
will  produce  hundreds  of 
thousands  of  sites, 

Alexa's  technology. 
starting  from  the  UPS  site, 
will  look  for: 

>■  Similar  words  and 
phrases  at  other  sites 

>  Other  Web  sites  that 
have  'linked"  to  the  UPS  site 
and  include  Information  on 
other  relevant  home  pages. 


►  TraiHc  patterns.  Alcxa 
tracks  where  users  were  be- 
fore arriving  at  UPS  and 
where  they  went  afterward. 
Integrating  these  patterns 
Into  Its  guidance  system. 

After  processing  the  above 
information.  Alcxa  suggests 
10  sites. 

Alexa  also  provides  a  brief 
dossier  for  each  suggested 
site,  such  as  the  company 
running  the  site,  the  speed  of 
Its  computers,  the  number  of 
pages  In  the  site  and  a  rating 
from  other  Alcxa  users. 
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y&H[BrewsterKahle 


Talks  of  Plan  For  a  Oratory 


By  Whit  Andrews 

BREWSTER  K-'VHLE'S  WAIS  InC. 
promoted  the  idea  that 
"search"  was  as  big  a  part  of  ir\- 
teractivitj'  as  "click"  in  the  early 
1990s,  a  notion  that  still  is  gaining 
currency  as  search  blanks  come  off 
back  pages  and  onto  the  toolbar. 

Kahle's  new  company — he  sold 
WAIS  to  America  Online  in  1995  for 
$15  million— is  Alexa,  which  aims 
to  promote  something  even  more 
\isceral.  along  the  lines  of  "Show 
me  where  I  am  and  where  I  should 
go."  The  earliest  version  of  this 
technology  was  a  sizable  download 
api)lic-ation;  it's  been  pared,  but  the 
ultimate  goal  is  to  create  a  skinny 
Web  service  with  no  technology' 
banier  between  a  user's  wanting  it 
and  his  using  it. 

Wl  letlier  Alexa  is  an  idea  tltat  wlU 
aid  newbies,  its  target  market,  in 
penetrating  tlie  Internets  mysteries 
is  loo  soon  to  say.  But  Kahle  has 
wiilked  litis  path  before,  and  it's  in- 
teresting what  he  sees  as  being  \oiy 
different  this  timeout. 

Internet  World:  What  has  clumged  in 
terms  of  getting  in  the  public  eye 
since  you  got  in  a  smaller  public's 
eye  with  WAIS? 

Brewster  Kahle:  Back  in  the  WAIS 
days,  it  was  a  concept  sell,  it  was  a 
x'ision  tiling.  It  was,  "E\'er>'body's  go- 
ing to  become  a  publisher,  it's  going 
to  be  on  the  Internet,  and  they're  go- 
ing to  u.se  W-MS."  Tlie  way  you  sold 
that  back  then  was  top-down.  So 
[Vice  President]  Al  Gore  was  a  pro- 
pon(Mit  of  WAIS.  I  went  category'  by 


categoiy  and  said,  "\\nio's  the  top 
pubhsher  in  that  field?"  and  1  shot 
for  it.  If  you  could  sell  those,  then 
the  rest  would  foUow. 

That's  completely  different  from 
what  we're  seeing  now.  Where 
Alexa  has  got  the  sort  of  coolness 
that  lands  us  on  NPR  and  that  t>pe 
of  progranmiing — and  that  helps  in 
the  public  relations,  to  have  a  big 
concept  associated  with  you — con- 
sumers couldn't  care  less,  really.  If 
it's  not  something  they  can  use 
right  then,  and  that  works  really 
well,  and  that  doesn't 
crash,  it's  not  in  their 
sphere.  They're  a  hardei 
group  to  sell,  in  fact,  thiui 
AI  Gore  and  the  head  of 
The  New  York  Times. 


fW:  Do  you  tltink  tlie  ethic 
has  changed  in  terms  of 
must-not-crash? 
Kahle:  We'\e  moved  from 
a  technology'  demons! ra-  Brewster 
tion  phase  to  "Let's  have 
it  be  useful  in  what  I'm  tning  to 
do."  People  are  starting  to  say  that 
the  aitswer  is  out  on  the  Net  some- 
place, but  you  can't  find  it.  That 
change  is  recent,  in  tiiat  we've  hit  a 
critical  mass,  and  people  ar-e  now 
applying  this  technology  to  their 
everyday  problems.  We  in  tlie  Inter- 
net are  not  there  yet. 

IW:  In  both  cases,  WAIS  and  Alexa, 
you're  promoting  an  intellectual  ex- 
perience (!iat  to  some  degree  is  be- 
yond i!r.;i;iuiation,  except  that  of 
the  mo: .:  visionan'  people.  You  suc- 


ceeded with  WMS  in  being  part  of 
this  new  world.  We  all  know  what 
"search"  means  now.  How  do  you 
cross  that  barrier  and  gain  critical 
mass  \\ith  Alexa? 
Kahle:  Probably  analogy.  Wlmt 
we're  trying  to  do  with  Alexa  is  be 
an  onscreen  directory,  which  is 
close  to  what  we  have  seen  on  the 
500-channel  satellites.  My  father, 
who  hates  the  Web  because  it's  so 
disorgaitized,  has  this  satellite  sys- 
tem, and  one  of  the  things  he 
wanted  to  show  me  the  most  was 
the  director^'.  It  wasn't  just  "Chan- 
nel :3  is  CBS."  It  was,  "This  is  CBS, 
and  here's  what's  showing  right 
now."  So  we  call  it  an  onscreen  di- 
rectory, and  that  really  has  worked 
for  a  lot  of  the  newbies. 

IW:  I  know  that  money  can  be  a  ma- 
jor issue  for  a  concept  play.  Are  you 
feeUng  pressui'e  from  investors,  or 
customers,  who  want  to  see  you 
making  money? 

Kahle:  Yes,  people  are  going  to  want 
to  make  sure  you're  go- 
ing to  make  money.  .\nd 
it's  not  just  customers. 
It's  employees,  and  all 
of  these  sorts  of  impor- 
tant people.  I  find  it's 
helpful  that  I've  been 
].iart  of  a  concept  that 
went  from  "You've  got  to 
be  cra2>"  to  "Of  course!" 
once  before.  The  thing 

Kahle  that  helps  now  is  all 

the  IPOs. 


(W:  Do  \'ou  find  the  enxironment  for 
getting  developers  and  strategists 
different  from  1992? 
Kahle:  It's  so  much  easier  now. 

IW:  W\^y  is  that'  Usually  I  hear  peo- 
ple say,  "I  can't  hire  anybody." 
Kahle:  You  compared  it  to  1992. 
Around  1995,  we  saw  the  start  of 
the  flood  of  talented  people.  1995, 
1996,  and  1997,  we've  seen  an  im- 
migration into  Ciilifomia,  and  lot  of 
people  trying  to  make  their  fortune. 
search:  AGENTS,  SEARCH  SERVICES 
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Seeing  the  Sites 
On  a  Custom  Tour 

New  Litemet  St^arch  Ibol  Takes  Selet'tive  Approach 


By  Rajiv  Chandrasek^an 


S.V^-  FRANCISCO 

For  more  than  a  year,  a  team  of 
computer  industry  entrepre- 
neurs has  been  using  a  roumftd 
of  computers  to  copy  every 
World  Wide  Web  pasje  they  can  lind. 

Their  massive  digital  library  origi- 
nally was  viewed  as  a  quixotic  venture 
to  assemble  an  Internet  tunc  capsule 
of  the  global  computer  nefwork's  early 
days. 

But  the  electronic  librarians  appear 
to  have  found  a  more  immediate — and 
potentially  profitable — use  for  their  2 
tnllion-charader  pile  of  data:  a  new 
and  precise  way  lor  ordinary  comput- 
er users  to  navigate  tlie  sprawling 
Web. 

To  find  what  they  re  looking  for  on 
the  Web,  most  computer  users  plug 
"keywords"  into  search  engines — 
Web  sites,  with  names  such  as  InJo- 
seek  and  Excite,  that  look  tor  other 
Web  sites  contamuig  that  keyword  and 
then  generate  a  list  of  sites.  That  can 
produce  a  long  list  to  wade  through. 


and  the  search  engines  otten  imss 
relevant  sites  because  the  keyword  is 
slightly  olf  target 

But  the  Internet  ."Xrchive's  ser 
vice— called  .^exa.  aflcr  the  ancient 
library  at  .Alexandna.  Egypt— doesn't 
rely  on  a  word  search 

Instead,  .Alexa,  a  software  add  on  for 
Web  browsers  that  can  be  downloaded 
fi'om  litip..' '  www.alcxa.com.  uses  sev- 
eral powerfid  computers  to  crunch 
through  the  archived  data — a  process 
called  data  mining— and  look  tor  pat- 
terns within  the  Web,  ultimately  pro- 
viding a  few  recommended  strands  to 
travel. 

From  Umted  Parcel  Service's  Web 
site,  for  instance,  Alexa  points  the  user 
to  sites  for  Federal  Express,  the  U.S. 
Postal  Service  and  DHL  Worldwide 
Express.  Starting  at  the  Web  page  of 
catalogue  clothing  retailer  Lands'  End, 
sites  for  J  Crew,  L  L  Bean  and  Victo- 
rias Secret  are  suggested 

"Sometimes  iis  spooky  how  helpful 
it  can  be."  said  Brewster  Kahle,  the 
archive's  foiun.;-!  ;,;id  itie  president  of 
St<".\K..!(;VK.I-H,C<>l.  1 


Ifyuii  tvcint  ni/unnatiuii  on  slap- 
ping a  package,  using  "fshippnig"  a^ 
a  keyword  for  a  ronreiitional  ifarcii 
engine  will  produce  hundreds  of 
thousands  of  sites,  including  i'F:^'-<. 
Alexa's  technology,  starling  from 
the  UFS  site,  will  look  for: 

■  Similar  words  and  phrases  at  other  sites. 

For  .nstance,  Alexa  nugnt  Hag  "de  .ers 
the  package"  or  "enter  your  track, ng 
number"  from  the  UPS  site  and  d.rect 
you  to  FedE,<'s  site,  whicfi  also 
conta  ns  phrases  close  to  that 

■  Otiier  Web  sites  that  have  "linked"  to  the 
UPS  site  and  include  information  en 
other  relevant  home  pages 

■  Traffic  patterns.  Alexa  tracks  where 
users  were  before  arriving  at  UPS  3",J 
where  they  went  afterward,  integral. hg 
these  patterns  into  its  guidance  systeni 
It  Federal  Express  were  the  most 
common  site  where  users  went  next  t 
would  likely  appear  as  a  suggested  s.te 
to  move  to. 

.Ajitr pmcessiiig  the  nhmc  iiifoiiiiatu'ii 
Alexu  suggests  Ulsilcs^  Fiviii  the  CFi, 
sOe.  .■Mmi  recently  steered  users  tu- 
rn FedEx 

■  DHL  Worldwide  Express 

■  US,  Postal  Service 

■  Bellinghani,  Wash  ,  Transportation 
Department 

■  P&O  Nediloyd,  a  Dutch  conta  ner 
shipping  company. 

Alexa  also  provides  a  brief  doss.er  'cr 
each  suggested  site,  such  as  the 
company  running  the  site,  the  speed  of 
its  computers,  the  number  of  pages  'n 
the  site  and  a  rating  from  other  Alexa 
users. 
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Kahie  is  the  man  to  watch 


Gannett  News  Service 


SAN  FRANCISCO  ~  When  he 
really  gels  going  about  his  ideas, 
such  as  archiving  the  entire  Inter- 
net or  mapping  the  Web  by  looking 
at  the  trails  users  leave,  Brewster 
Kahle  can  sound  like  Robin 
Williams  —  if  Williams  had  gone  to 
MIT 

Kahle  hunches  down  beside  a 
Coke  machine-size  box  full  of  com- 
puter tapes  that  together  hold  the 
equivalent  of  about  50,000  books.  His 
hands  leap  into  the  air,  his  voice 
shifts  octaves  and  suddenly  he's  two 
supercomputers  trying  to  find  a 
speed  at  which  they  can  talk 

"'How's  this  for  you'''  Faster,  lots 
faster.'  'Fine,  how's  this?'  'Ncnoise 
on  this  end.'  'Great,  I'll  start  ship- 
ping data  now'  'Fantastic.  I'm 
ready!"" 

Described  by  Microsoft's  Chief 
Technical  Officer  Nathan  Myhrvold 
as  a  crazed  lunatic,  briUiant  vision- 
ary and  nice  guy  all  rolled  into  one, 
Kahle  has  always  thought  big 

In  the  early  1980s,  he  was  a  scien- 
tist at  a  company  called  Thinking 
Machines,  where  he  designed  mas- 
sively parallel  computers,  powerful 
machines  made  up  of  100,000  small 
computers  connected  by  a  fast  net- 
work They  broke  huge  problems 
into  small  bits  that  could  be  solved 
simultaneously. 

Now  Kahle,  36.  is  turning  his 
expertise  in  the  analysis  of  really 
big  amounts  of  data  to  the  problem 
of  finding  things  on  line.  His  new 
company,  Alexa  Internet,  Is  a  World 
Wide  Web  navigation  service  that 
gives  users  information  about 
where  they  are  and  also  recom- 
mends where  to  go  next. 

At  his  offices  in  a  renovated  Victo- 
rian general  store  in  the  former 
Presidio  Army  base  here,  Kahle 
talks  a  visitor  through  the  service, 
his  blue  eyes  glowing  with  pride 
under  a  cloud  of  curly  blond  hair 

Like   a   trained  tracker.    Alexa 


(http //www. alexa  com)  helps 
guide  users  roaming  the  Web 
Named  for  the  lost  library  in 
ancient  Alexandria,  it  creates  a  thin 
toolbar  that  shows  a  constant 
stream  of  information:  who  regis 
tered  the  site  you're  at,  how  often 
it's  updated,  how  many  pages  It  con- 
tains. 

But  it's  as  a  guide  that  Alexa 
shines  Using  anonymous  data  from 
Net  traffic  nodes,  the  service  sees 
what  paths  others  have  taken  and 
offers  them  to  users  as  a  small  pop- 
up list,  ranked  by  which  links  were 
most  heavily  traveled 

-It's  a  sort  of  chatty  navigator 
that  in  some  me:aphorical  sense  has 
talked  with  a  \(>i  of  people  and  can 
give  advice  about  routes  to  take." 
says  Jerry  Michalski  of  Release  1.0. 
an  mdustry  ncv-sletter 

And  users  never  have  to  see  the 
dreaded  "Error  404  —  page  not 
found"  message.  If  a  Web  page  no 
longer  exists,  Alexa  will  find  a 
recent  copy  in  Kahle's  Internet 
Archive  and  serve  it  up.  Alexa  must 
be  downloaded  to  be  used;  It's  ad- 
supported  and  free  to  users. 

Both  Alexa  and  the  archive  spring 
from  Kahle's  fascination  with 
libraries  as  founts  of  information 
His  mterest  dates  to  the  late  1980$, 
when  he  developed  the  Wide  Area 
Information  Server,  a  pioneering 
Net  publishmg  system 

Although  HTML  won  out  as  the 
primary  publishing  format,  the 
method  Kahle  devised  for  indexing 
the  Net  became  one  of  the  most  pop- 
ular lookup  tools  of  the  time,  so  pop- 
ular that  in  1995,  America  Online 
paid  $1S  million  for  it.  That  gave 
Kahle  the  funds  to  pursue  his 
dreams,  one  of  which  was  preserv- 
ing the  digital  past. 

No  one  who  knows  Kahle  is  sur- 
prised that  when  he  decided  there 
should  be  an  archive  of  the  Internet, 
he  just  sat  down  and  made  one.  He 
tends  to  pursue  his  passions  whole- 
heartedly, whether  they're  techni- 


cal problems  to  be  solved  or  social 
networks  to  be  knit. 

Take  the  Thursday-night  poUuck 
dinners  Kahle  and  his  wife,  Mary, 
have  held  for  the  past  10  years.  - 
Each  includes  a  question,  "What's 
the  moat  interesting  game  you've 
ever  played?'  or  "What's  the 
strangest  place  you've  ever  slept?" 
Every  guest  is  obhged  to  answer  in 
the  form  of  a  story 

Kahle,  who  graduated  from  MIT 
with  a  degree  in  artificial  intelli- 
gence and  Eastern  religions, 
delights  in  inviting  newcomers  to 
these  meals,  the  couple's  answer  to 
the  problem  of  meeting  interesting 
people  once  they'd  left  college 

It's  his  engineer's  let's-make- 
something-to-fix-this-problera  atti- 
tude that  has  gotten  Kahle  so  far. 
But  sometimes  his  enthusiasm  for 
ideas  causes  him  to  overlook  practi- 
cal considerations,  observes  one  of 
the  fathers  of  the  Internet,  Vinton 
Cerf,  now  at  MCI. 

Kahle  has  sidestepped  the  loom- 
ing issues  of  copyright  and  privacy  I 
raised  by  copying  Web  pages  with-  i 
out  their  creators'  eiprcss  permis- 
sion You  can  always  take  a  page 
down,  but  if  it's  in  the  archive,  any- 
one can  still  access  it.  Special  cod- 
ing can  be  included  to  prevent  a 
page  from  being  archived  or 
indexed,  but  many  people  don't 
know  this  , 

Kahle  says  that  by  not  worrying  , 
about  the  details  he's  able  to  do  . 
things  others  think  are  impossible . 

And  that  alone  is  enough  to  make 
Cerf,  someone  who  knows  a  thing  or 
two  about  big  plans,  respect  Kahle: 
'He's  contributed  more  than  his  fair 
share  of  interesting  and  innovative 
ideas" 

Says  Cerf,  "I  think  Brewster  is  the 
kind  of  visionary  who  bears  watch- 
ing" I 
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Alexa  Internet  [nc ,  a  San  Francisco 
I  start-up  company  that  intends  to  offer 
the  new  navigation  service  for  free  to 
Web  users 

'  This  is  not  the  first  time  someone  has 
claimed  to  have  cracked  the  code  for 
making  the  Web  eabily  searchable  And 
it's  not  clear  whether  Alexa.  which  is 
being  tested  by  about  10.000  people 
now,  would  function  weB  if  millions  of 
people  tried  to  use  it  But  in  its  eariy 
form  it's  getting  generally  upbeat  re- 
views in  the  computer  industry 

"Search  engines  can  drown  you  in 
documents,"  said  Jerry  Michalski.  man- 
aging editor  of  Release  1  0.  a  New 
York-based  computer  industry  newslet- 
ter 

,\sking  the  Infoseek  engine  to  find 
sites  with  the  keyword  "shipping."  for 
example,  generates  more  than  291.000 
citations,  with  sites  for  a  British  port  and 
an  exotic-bird  store  at  the  top  of  the  list, 

".Alexa  is  a  big  step  forward."  Michal- 
ski said  "Its  like  having  a  huge  associa- 
tive memory  that's  looked  at  most  of  the 
Internet" 

Alexa  works  largely  by  looking  for 
Web  sites  with  suiular  content,  then 
linking  them  together  A  user  starts  with 
one  Web  site,  and  then  the  data-mining 
computers  search  for  other  pages  that 
have  several  similar  words  and  phni-ses 
The  computers  suggest  sites  that  are 
most  closely  related 

People  who  create  Web  pages  also 
include  links  to  other  pages,  but  those 
links  are  often  limited— Company  A 
doesn't  want  people  visiting  its  page  to 
go  to  the  page  of  its  competitor  Compa- 
ny B.  and  so  it  includes  no  link  Alexa. 
however,  tries  to  function  as  an  objective 
source  .is  to  what  pages  should  be  tied 
together 

Tlie  machines  also  troll  through  the 
maze  of  connected  pages  that  make  up 
the  Web,  hunting  for  the  most  relevant 
sites  that  have  links  to  the  user's  start- 
ing page 

Those  suggestions,  offered  in  a  small 
box  at  the  bottom  of  a  user's  screen,  are 
shaped  by  the  surfing  patterns  of  other 
Alexa  users,  Kahle  said  The  sites  they 
choose  to  jump  to  are  tracked,  those 
patterns  wiD  influence  the  10  sites  pre- 
sented in  the  bfix 

For  example,  if  many  Alexa  users  at 
the  UPS  Web  site  don  t  jump  to  the 
FedEx  site,  it  could  be  removed  from 
the  boxed  list,  even  though  the  two  sites 
are  very  common,  Kahle  said, 

,\t  the  same  time,  if  .Mexa  users  at  the 
UPS  site  tend  to  mow  to  the  .AFL-CIO's 
Web  page,  that  site  could  be  suggested 
to  future  Ale.xa  users  visiting  the  UTS 
site,  even  though  the  two  aren't  associat- 
ed in  any  way  beyond  the  AFLrCIO's 
support  for  the  Teamsters  'in  their  re- 
c  '  strike  against  UT^. 

-V  like  asking  somebody  if  they've 


Under  a  "Where  You  Are"  box.  Alexa  provides  readily  accessible  Information  on 
the  site  you're  visiting,  such  as  company  revenue  and  a  rating  of  the  site. 


read  any  good  books  lately."  Kahle  said. 
He  said  that  because  of  privacy  con- 
cerns, Alexa's  computers  will  not  keep 
records  of  users'  names  as  they  jump 
fi-om  site  to  site 

Some  early  users  question  whether 
the  service  is  detailed  enough  Al- 
though it  will  suggest  whole  Web  sites, 
it  can't  suggest  specific  Web  pages  that 
are  similar  to  the  page  a  user  has  on  his 
screen. 

The  database  is  a  bttle  immature." 
said  Tiernan  Ray  an  analyst  with  Tech 
nologic  Partners  in  New  York  "It  cant 
give  you  much  information  about  a 
specific  page  buried  several  layers  into  a 
site " 

And  its  suggestions.  Michalski  said, 
"might  take  you  to  something  related, 
but  very  random," 

From  the  Wiite  House  Web  site,  for 
example,  Alexa  suggests  not  just  the 
House  of  Representatives,  the  FBI  and 
the  Library  of  Congress,  but  also  PC 
Magazine  and  a  service  to  look  up  stock 
quotes 

"It's  not  ahways  perfect."  Kahle  con 
ceded,  "But  we  think  it's  a  big  improve- 
ment fi-om  the  other  ways  of  searching 
the  Net  today," 

Part  of  Alexa's  expected  appeal, 
.Michalski  said,  is  its  usefulness  both  in 
finding  specific  infnrmabon  and  in  sim- 
ply cruising  tlie  Web 

Tou  gel  this  list  of  sites,  and  some  ol 
them  are  going  to  be  right  on  the 
money  And  you  can  just  hop  to  them 
vfithout  having  to  go  back  to  your 
search  engine,"  he  said 

In  addition  to  providing  "where  to  go 
next"  advice,  the  Alexa  service  can  tap 
into  another  facet  of  the  archive — its 
snapshots  of  the  Web's  past.  Web  pages 
of  the  Heaven's  Gate  cult,  for  instance, 


are  no  longer  actively  offered  on  the 
Internet  but  can  be  culled  fi-om  the 
archive  here 

The  service  also  tells  users  basic 
facts  about  the  site  they're  visiting, 
including  which  company  runs  the  site, 
how  fast  its  computers  are  and  how 
many  Web  pages  the  site  contains 

"It's  very  useful  information,"  Ray 
said  "It's  something  that  most  Web 
surfers  have  no  idea  about  today" 

Alexa.  which  has  received  .S5  million 
in  ftinding  fi-om  Etoile.  the  Swiss  in- 
vestment company  that  owns  Encyclo- 
paedia Britannica  Inc..  plans  to  make 
money  trom  the  service  through  ad- 
vertising, which  will  apfjear  in  the  box  at 
tlie  bottom  of  the  scTcen 

Kahle.  who  started  /Vexa  with  former 
Dow  Jones  &  Co  executive  Bill  Dunn, 
eventually  plans  to  merge  some  of  the 
encyclopedia's  content  with  Alexa's 
Part  of  the  profit  Kahle  hopes  Alexa  will 
generate  could  be  used  to  exp.ind  and 
m.aintain  the  archive 

The  service  already  has  about  10.000 
trial  users  and  has  the  capacity  to  handle 
I  million  users,  Kahle  said, 

•Alexa  also  wiD  let  its  users  vote,  with 
one  mouse  click,  on  whether  they  like  a 
particular  site.  Those  ratings  wiD  be 
offered  to  users, 

■The  Web  allowed  everyone  with  a 
computer  to  become  a  publisher,"  Kahle 
said  "Alexa  allows  them  to  become 
editors " 

FOR  MORE  INFORMATION    ^ 

To  tour  the .  l/cj<2  ute  andprei-irw  its 
search  metUnil,  click  on  the  above 
symbol  on  the  front  l>age  ofVie  Post's 
\lrb  site  lit 
http:,' ,  wu'w.wa!Jiutgtonpost.com 
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Kiss  That  404  Goodbye 

by  Chris  Jones 

3  09pm  22  Sep  97.PDT  What  does  one  do  with  an 
archive  of  the  entire  public  content  of  the  Web?  A 
new  navigation  service  debuted  today  that  aims 
to  answer  that  question  by  giving  users  a  free 
toolbar  with  which  to  explore  a  more 
comprehensive  yet  focused  rendering  of  the 
Internet. 

The  Alexa  Internet  navigation  service  is  a 
searching  and  reference  tool  that  helps  users  find 
relevant  Web  sites  and  research-related 
information  via  the  Britannica  encyclopedia  and 
Webster's  dictionary  and  thesaurus.  The  Alexa 
tool  recommends  sites  based  on  previous  usage 
patterns,  hyperlink  information,  and  data 
collections  from  other  sources.  This  metadata  is 
analyzed  by  Alexa's  data  mining  and  collaborative 
filtering  engines,  but  rather  than  returning 
hundreds,  or  even  thousands,  of  URLs  -  a  la 
AltaVista  -  the  tool  distills  10  sites  it  deems  most 
relevant. 

"We're  not  competing  with  directory  sites  like 
Yahoo.  We  try  to  keep  you  on  track  by  giving 
more  precise  recommendations  based  on  what 
other  people  say.  It's  more  precise  than  other 
search  engines,  and  has  a  much  higher  coverage 
of  the  Web,"  said  Brewster  Kahle,  co-founder  of 
Alexa. 

Once  downloaded,  the  Alexa  navigation  bar 
launches  in  sync  with  a  browser,  and  consists  of 
a  handful  of  icons  encapsulated  in  a  slim, 
rectangular  box.  The  toolbar  supplies  information 
about  a  site  -  who  it's  registered  to,  how  many 
pages  it  contains,  how  frequently  it's  updated  - 
and  recommends  other  sites  that  include  similar 
information.  The  tool  also  delivers  ratings  -  XXX, 
for  instance  -  that  reflect  the  content  of  the  site, 
and  includes  TRUSTe  ratings  that  rate  the 
privacy  and  data-collection  policies  of  a  site. 

Kahle  has  in  the  past  few  years  been  involved 
with  the  nonprofit  Internet  Archive  project,  which 
aims  to  archive  as  many  files  as  possible  -  from 
the  Web  and  FTP  sites,  Usenet,  and  other 
sources.  Wth  about  640,000  sites,  and  100 
million  pages,  Kahle  said  the  system  has  7 
terabytes  of  information  stored.  But  with  the 
number  of  sites  on  the  Net  doubling  every  six 
months  -  and  existing  sites  adding  new  depths 
and  layers  all  the  time  -  keeping  pace  becomes  a 
daunting  task. 

One  advantage  of  having  a  full  archive  of  the 
Web  is  that  if  a  user  gets  the  dreaded  "404  - 
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The  Seybold 
Report  on 
Internet 
Publishing 

8/97 

Matt  McKenzie 

Site  Reviews:  Alexa 
Internet 

Media  Central 
Digest 

7/29 

Sean  Wolfe 

Alexa:  Merging  Web 
Searching  and  Group 
Experiences 

Internet  Outlook 
(http://www. 
webreference, 
com) 

7/28 

Richard  Wiggins 

Alexa  Keeps  Surfers  on 
Beaten  Path 

Inter@ctive  Week 
(print) 

7/28 

Tom  Steinert-Threlkeld 

Alexa  Searches  While 
You  Surf 

PC  Week 

7/28 

Jim  Kerstetter 

Blacks  Say  Internet 
Offers  Equality 

The  Plain  Dealer 
(Cleveland,  OH) 

7/28 

From  staff  and  wire 
reports 

Officials  See  No  End  in 
Sight  for  the  Popularity 
of  the  Internet 

Post-Bulletin 
(Rochester,  MN) 

7/28 

Jon  Swartz  (SF 
Chronicle) 

New  Service  Tracks  Web 
Use 

Plainview 
Herald 

7/28 

New  York  Times 
Service 

Revolution  Brewing  at 
Presidio:  Alexa  Internet 
hopes  to  change  the  way 
you  use  the  Net 

San  Francisco 
Examiner 

7/27 

Zachary  Coile 

Net  Growth  Virtually 
Exploding 

Austin  American 
-  Statesman 
(Austin,  TX) 

7/26 

Jon  Swartz  (SF 
Chronicle) 

The  Internet  Boom  Is  Just 
Beginning:  Big  business  is 
the  next  frontier, 
attendees  at  Chicago 
trade  show  say 

San  Francisco 
Chronicle 

7/25 

Jon  Swartz 

Getting  Even  More 
Personal... 

Interactive 
Services  Report 

7/25 

No  byline 

Headline:                      Publication:           Date:                  Author:                1 

E-commerce  about  to 
launch? 

The  Middletown 
Press 

(Middletown, 
CT) 

7/25 

Jon  Swartz  (SF 
Chronicle) 

Outside  The  Box 

San  Jose  Mercury 

News 

7/24 

Chris  Nolan 

As  Go  Surfers,  So  Goes 
Alexa 

Wired  News 

7/24 

Chris  Oakes 

Internet  Industry  Giants 
Snub  Expo 

The  Desert  Sun 
(Palm  Springs, 
CA) 

7/24 

Jon  Swartz  (SF 
Chronicle) 

2  Internet  Giants  Won't 
Show  at  Expo: 
Microsoft,  Netscape  are 
skipping  Chicago 

San  Francisco 
Chronicle 

7/23 

Jon  Swartz 

New  Ad-Backed  Service 
to  Monitor  Context  of 
Web  Pages 

Internet 

Advertising 

Report 

7/23 

Compiled  by  Beth  Cox 

Service  Turns  Page  in 
Aiding  Web  Users 

The  Oakland 
Press  (Pontiac, 
MI) 

7/23 

New  York  Times  News 
Service 

New  Guide  Offers  Net 
Directions 

Inter@ctive  Week 
(online) 

7/22 

Tom  Steinert-Threlkeld 

New  Search  Tool  Tracks 
User's  Path  Through 
Internet 

The  San  Diego 
Union-Tribune 

7/22 

New  York  Times  News 
Service 

News.Scan:  Today's 
Headlines  from  the 
Mainstream  Press 

PC  Magazine 
Online 

7/22 

NewsWatch 

Looking  for  a  Second 
Commercial  Hit, 
Brewster  Kahle  Launches 
a  Navigation  Service 

Web  Week 

7/21 

Margaret  McKegney 

Alexa  Internet  Lands  at 

San  Francisco 

7/21 

Steve  Ginsberg 

Headline:                      Publication:           Date:                  Author:                1 

the  Presidio 

Business  Times 

New  Service  Tracks  Web 
Use 

Nando.net 
(Reprint  of  NY 
Times  Article) 

7/21 

John  Markoff 

New  Service  Tracks  Web 
Use 

New  York  Times 

7/21 

John  Markoff 

Alexa  Internet:  The 
Search  as  a  Communal 
Effort 

New  York  Times 

(CyberTimes 

Extra) 

7/19 

Laurie  J.  Flynn 

Recent  &  Decent  Web 
Tools 

Family  PC  News 
Online 

7/18 

Michelle  Megna 

Alexa  Browser 
Companion  Makes 
Searches  More  Certain 

PC  World  Online 

7/18 

Lisa  Moskowitz 

Re-engineering  Surfing 

San  Jose  Mercury 
News  Online 
(link  to  TechWeb 
story) 

7/18 

Patricia  Sullivan, 
Online  Editor 

Internet  Historian 
Creates  Tool  to  Inform 
Web  Site  Visitors 

ZDNN--  The 
ZDNet  News 
Channel 

7/18 

Renee  Deger 

Alexa  Debuts  Search 
Engine  That  Learns  from 
Its  Users 

ZD  Internet 

MegaSite 

Magazine 

7/18 

no  byline 

Startup  Seeks  To 
Synthesize  Web 

TechWeb 

7/17 

John  Gartner 

Alexa  Internet  Introduces 
Web  Navigation  that 
Learns  from  People 

LA  Times, 
BizWire  Section 

7/17 

Press  Release 

Alexa  Makes  the  Web 
History 

C/Net 

7/9 

Alex  Lash 

Personal  Access: 
Information  abounds  on 
the  Internet,  which  means 
virtually  anyone  can  find 
out  virtually  anything 
about  someone  else 

Tucson  Citizen 

6/30 

Leslie  Miller 
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Once  Sojourner  landed  on  Mars,  one  of  the 

important  directives  that  its  Eaith-bound 

project  engineers  had  to  relay  to  the  rover 

each  day  was  which  way  to  head  out. 

Specifically  they  had  to  determine  how  it 

could  most  efficiently  reach  the  coordinates  of  its  destination.  ^Si  To  that  end,  engineers 

created  a  trajectory  for  the  rover  by  evaluating  a  complex  matrix  of  images  captured  on  a 

terrain  panorama,  coupled  with  data  compiled  from  multiple  hazard-detection  devices. 


Unearthing 

the  Web's  New 

Pathfinders 


Navigation— ^o  NASA  at  least— Is  truly  a 
science  ind  notsomeChlng  that  can  be  left 
In  chance  oi  urcumstance. 

Building  on  that  theory,  sojourners  on 
the  Web  could  also  benefit  from  keen  nav- 
igadon  devices.  These  come  in  two  forms: 
Intuitive  client-based  search  tools,  and  on 
the  server  side,  clear  pointers  within  a  site 
that  illuminate  where  usen  should  go  up- 
on arrival.  While  you  cant  always  plow  a 
path  for  users  through  your  Website,  at  the 
v-ry  least  you  should  open  all  the  hatch- 
es into  th*  sit»  and  funnel  traffic  into  the 
areas  you  want  explored. 

Fortunately,  great  strides  in  technology 
have  unearthed  a  bevy  of  pathfinders  that 
address  both  the  client  side  and  the  serv- 
er side  of  the  navigation  equabon.  On  the 
client  side,  one  smart  new  product  is 
Alexa  Internet's  navigation  service  (www. 
alexa.com).  Functioning  as  a  tiny  helper 
app  for  your  browser.  Alexa  provides  a 
stream  of  relevant  recommendations  of 
where  to  go  next.  As  the  service  matuies, 
so  will  the  quality  of  the  recommenda 
tiotu,  as  they  are  derived  from  Alexa's  on- 
going analysis  of  traffic  patterns — proxy 
caches  and  anonymous  usage  trails.  In 
essence,  Alexa  t>ecomes  the  Nielsen  sys- 
tem of  the  Net,  as  it  harnesses  the  collec- 
tive wisdom  and  taste  of  the  community. 

As  clients  move  smartly  from  site  to 


NAVIGATION  IS  TRULY 
A  SCIENCE  AND  NOT 
SOMETHING  THAT  CAN 
BE  LEFT  TO  CHANCE. 

site,  those  on  the  server  side  also  need  to 
know  how  users  are  rummaging  through 
Websites.  Tlaffic  analyzers,  entering  their 
savvy  second-generation,  are  a  must-have 
mvestment.  These  tcxjls  now  go  far  be- 
yond the  static  logs  of  the  early  days.  In- 
stead, they  generate  dynamic  tables  and 
graphs  compiled  from  sophisticated  path 
analysis  and  user  interaction. 

For  example,  Interlogue  Communica- 
tions iwww.inteTtogue.com)  Whirl  evalu- 
ates the  movement  of  users  across  a  site 
and  Identifies  the  optimal  path  to  a  desti- 
nation .  By  examining  users'  entry  and  ex- 
it points,  products  such  as  this  can  help 
you  understand  what  brought  people  to 
yout  site  and  what  caused  them  to  leave. 

AccAje  Software  Insight  (www.accrue. 
com)  offers  similar  live  navigationai  acu- 
men. During  a  recent  demonstration.  In- 
sight allowed  us  to  watch  a  user  plod 
through  the  Harley-Davldson  site  for  four 
hours,  all  during  the  day  while  he  v^as 
logged  on  from  his  government  research 
lab  address  (don't  tell  his  boss).  Insight  re- 


vealed where  he  had  been  before  his  mo- 
torcycle excursion,  how  long  he  waited  to 
view  bike  pics,  and  how  successful  he  was 
in  downloading  each  page  before  Web 
attention  deficit  disorder  kicked  in. 
These  are  only  two  examples  of  the 
many  smart  products  available,  and  there 
are  hundreds.  Use  them  In  tandem  with 
another  "live "  cool— the  expenences  of 
others  who  have  managed  to  capture  vir- 
tual customers.  In  "Secxets  of  Successful 
Web  Stores"  on  page  91,  Jaclyn  Easton 
describes  how  Ken  Crane  leveraged  his 
Southern  CA-based  laser  disc  chain  to 
build  an  e-business  that  has  grown  600 
percent  smce  1995  Crane's  distributor, 
Martin  Greenwald.  sums  up  the  secret  of 
Crane's  success — he  knows  his  customers 
and  their  Web-buying  ways.  And  thats  an 
arduous  task 
with  anony- 
mous, virtual 
shoppers.  Adds 
Greenwald;  "He's 
like  Wal-Matt. 
He  makes  money 
through  traf- 
fic, not  units.' 
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ON      THE      WEB 


FRONT     HOME     FINANCE    FOREIGN    FEATURES    SPORT    OPINION     LETTERS 
COMPUTIMES  Monday,  September  29,  1997 


MONITOR 


Death  of  '404s' 


Remember  the  Internet  Archive  (Computimes,  January 
27th)?  The  people  behind  the  non-profit  project  to  maice  a 
"snapshot"  of  the  Net  have  just  launched  a  very  nifty 
Internet  navigation  service.  Called  Alexa,  the  free  software 
(at  li[ip:/'/\v\\Av. alexa. com)  gives  a  toolbar  for  finding 
related  Web  sites,  and  other  icons  tell  you  information 
about  a  site  -  who  it's  registered  to,  how  many  pages  it 
contains  etc.  The  tool  also  includes  an  "instant  messaging 
service"  to  see  if  your  friends/colleagues  are  online. 

Best  of  all,  though,  Alexa  ties  in  to  the  Internet  Archive 
itself.  So  if  you  get  one  of  those  "404  -  document  not 
found"  messages,  the  Alexa  server  delivers  an  archived 
version  of  the  outdated  page  (if  it  has  it).  Currently  only 
available  for  Windows  95  and  NT  users,  but  Windows  3. 1 
and  Mac  versions  are  on  the  way... 
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NUA  DEAL:  The  American  Export  Group  has  given  Irish 
Internet  consultancy  Nua  the  contract  to  develop  a  Web 
site  for  a  database  of  47,000  US  export  companies  in  eight 
languages.  "To  win  such  a  contract  in  the  face  of  heavy 
competition  from  American  Internet  developers  proves  that 
Nua  is  becoming  a  serious  player  in  Internet  development 
internationally,"  says  Nua's  Gerry  McGovem. 

ONLINE  SHOPPING:  Ireland  On-Line  is  to  sell  its 
Internet  starter  kit  in  Dunnes  Stores  outlets  nationwide. 
The  £4.99  pack  includes  a  free  month's  full  access  to  email, 
the  Web  etc  and  connection  software.  It  will  be  sold  in 
over  500  retail  outlets  including  Dunnes,  Xtra-Vision  and 
HMV. 

APPLE  HARVEST:  Apple  launched  its  new  OS  8 
Macintosh  operating  system  in  Ireland  on  Friday,  and  will 
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be  displaying  it  at  the  Apple  Expo  in  the  RDS  next 
Wednesday  and  Thursday.  The  company  has  already  sold 
over  1 .5  million  copies  in  the  US,  and  it  has  a  user  base  of 
60.000  people  in  Ireland. 

GAME  OVER:  IBM's  Deep  Blue,  the  supercomputer  that 
beat  world  champion  Garry  Kasparov  in  May,  has  gone 
into  retirement.  A  less  powerful  version  called  Deep  Blue 
Junior  will  still  play  demonstration  games,  but  Kasparov 
said  from  his  home  in  Moscow  that  he  was  "very 
disappointed"  by  the  announcement.  He  had  challenged 
IBM  to  a  rematch. 

EMAIL  PROTECTION:  The  head  of  the  European 
Commission's  "free  flow  of  information"  unit,  Ulf 
Bruehann,  has  called  for  an  independent  EU  data 
protection  agency.  He  told  the  International  Conference  of 
Privacy  Data  Protection  Commissioners  in  Brussels  that 
the  agency  would  protect  citizens  against  the  increased 
risks  to  privacy  posed  by  the  growing  international  traffic 
in  personal  data.  He  said  a  key  area  to  be  monitored  was 
the  treatment  of  email. 

CNTRL+ALT+DEL:  Last  week's  Computimes  said  that 
PostGEM/IOL  was  the  first  Irish  Internet  service  provider 
to  join  LINX.  In  fact  Global  One  says  most  of  its  Irish 
customers,  including  Medianet,  have  been  connected  to 
LINX  since  1995. 

FASTER  BROWSING:  Intel  has  introduced  new 
technology  that  speeds  the  delivery  of  Web  pages.  Users  do 
not  have  to  add  software  or  hardware  to  their  PCs  because 
its  done  at  the  server  end.  Several  Internet  providers  plan 
to  conduct  trials  of  its  Intel  Quick  Web  Technology  next 
month.  It  uses  several  techniques  to  accelerate 
downloading  of  graphics.  The  software  analyses  Web 
pages,  searching  for  images  and  compressing  them  by 
removing  unnecessary  data  bits. 

SPAMMERS  EJECTED:  Anti-spammers  on  the  Internet 
say  a  major  battle  has  been  won  after  Sanford  Wallace's 
Cyber  Promotions  and  two  other  unsolicited  bulk  email 
operations  were  kicked  off  of  AGIS,  a  major  ISP  in  the 
States.  Spam-hunters  believe  AGIS  is  reconsidering  its 
open  policy  toward  hosting  unsolicited  bulk  email  vendors. 

HP  TELECOMMUTERS:  Up  to  40  per  cent  of 
HewiettPackard  Ireland's  sales  staff  are  to  work  from  home 
"or  other  non-office  locations".  The  teleworkers  will  have 
remote  access  to  HP's  huge  intranet  which  has  over 
100,000  users. 
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IN  BRIEF. . .  The  merger  of  accountancy  firms  Price 
Waterhouse  and  Coopers  &  Lybrand  will  also  create  the 
second  largest  technology  and  business  consulting  firm, 
with  Andersen  Consulting  maintaining  the  number  one 
position.  .  .  IBM  has  developed  a  way  to  substitute  copper 
for  aluminum  in  making  semiconductors,  using  a  patented 
"fusion  barrier"  that  keeps  the  copper  from  "poisoning"  the 
silicon.  .  .  Progressive  Networks,  which  makes  streaming 
media  tools  for  the  Web,  has  bought  a  major  Web  site  for 
film  reviews,  Film.com.  The  site  generates  two  million 
page  views  a  month.  .  .  Tests  by  Info  World  magazine  have 
found  that  100  per  cent  Pure  Java  applications  run  on  over 
a  dozen  platforms  including  Sun's  Solaris,  OS/2,  Windows 
NT,  Windows  95,  Windows  3.1  and  Macs.  .  . 

Canadian  software  firm  Corel  has  had  another  disastrous 
quarter  with  a  $3 1 .4  million  loss  on  sales  of  $55.8  million. 
A  further  loss  of  $15-$20  million  is  also  expected.  .  . 
Gateway  2000  has  introduced  its  Solo  2300  Multimedia 
Notebook,  priced  at  £2,730.  .  , 

Super  Value-Centra  has  awarded  IBM  a  £1.5-million 
contract  to  supply  a  computer  system  to  run  Ireland's  first 
centralised  distribution  network  for  chilled,  fresh  and 
frozen  foods.  .  .  Seagate  has  opened  its  new  plant  in 
Limavady,  bringing  its  Northern  Ireland  workforce  to 
1,500  people.  .  . 


MICROFILE 

In  a  survey  by  International  Data  Group: 

52%  of  companies  in  Asian  Pacific  countries  excluding 

Japan  believe  the  Internet  is  "important"  or  "very 

important"; 

30%  of  Japanese  companies  and  30%  of  US  companies  are 

in  this  category; 

only  16%  of  West  European  companies  share  this  view 

An  America  Online  survey  found  that: 

37%  of  its  subscribers  watch  less  TV  than  they  used  to; 

22%  watch  less  video; 

7%  watch  more  TV  and  6%  watch  more  video; 

the  Net  "has  had  no  effect  on  radio,  no  effect  on 

magazines,  and  little  effect  on  newspapers" 

According  to  Boardwatch  magazine,  approximate  no.  of 
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Alexa  1.0  Adds  Tools  and  Resources 

Losing  the  beta  blues,  Web  navigation  service  Alexa  (http;//www.alexa.coiTi)  has 
relaunched  its  service  and  beefed  up  the  content  available  to  its  users  through  an  alliance 
with  Encyclopaedia  Brittanica  (http://www.eb.com). 

The  new  alliance  aims  to  put  reference  materials  at  users'  fingertips  so  they  can  retrieve 
information  without  leaving  the  Web  page  they're  on. 

After  being  in  beta  testing  for  roughly  three  months,  Alexa's  new  version  (1.0)  features  a 
fully  debugged  toolbar  that  attaches  to  a  user's  browser.  The  toolbar  provides 
information  about  each  site  requested,  its  owners  and  relative  traffic.  If  the  site  sought  is 
off-line  but  otherwise  publicly  available,  users  can  usually  find  an  archived  copy  of  the 
page  on  Alexa's  database. 

Users  can  access  content  from  new  partner  Encyclopaedia  Brittanica  through  a  pop-up 
browser  window  launched  via  the  toolbar.  The  alliance  pairs  Encyclopaedia  Brittanica's 
32  volumes  and  some  5,000  with  an  online  dictionary  and  a  thesaurus  from 
Merriam-Webster  to  create  a  kind  of  reference  shelf. 

Cowles/Simba  Media  Daily  9/24/97 
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Kreserving 
a  moving  target 


The  Internet  Ardiives  hopes 

to  save  Web  content  =  all  of  it  - 

forposterihj 


dovor-ioftdac! 


By  Tom  Spring 

r     he  World  Wide 
uai  cydeofhere 
today,  gone  to- 
morrow The  In- 
:«iTiei  r^u  log 
onto  one  day  can 
vaiush  the  noct. 
Inst  m  the  cybe^ 

abj'ES  of  digits 

Yet,  M  cytwr- 
apace  growa  to  be 
e  publj!:  fbnao 
where  home- 
pBgw.  Web  anefl, 
and  scholarly  et- 
aays  reaide,  the 

reflection  of  our 
digital  hantagB 
experta  beUri/^ 
A^d  Uks  the  &nK 
bv  i^.  Slna.  «r.dev&o  aev.5pspei5  now  dasa>isi 
aril  unrecogruzflble,  history  threatens  to  repeat 
1L..II  with  the  World  Wide  Web. 

\,!ucb  h»j  b?*n  »*id  about 
arcruving  the  Intamot,  but  little 
h j:l  be«ri  done  -  ur.Sl  w«r.tly. 

About  a  year  ago.  a  nonprofit 
or^amzation  tist  the  firet  foot- 
pH.nts  of  the  In/brroatin.n  Agn  t-o 
tw  preset  ed  for«vw.  TlianKs  W 
ocnpuCcr  acientiat  and  miUjon- 
airc  erxtrepreneujr  Brewatcr 
KahJe,  th»  Internet  -  from  the 
iubluns,  to  tha  ndiruioiifl.  to  the 
bnHjanf  -  uriU  bs  preserved  ui 
It*  chaotic  e.xjctitwd'e 

S\no*n  as  The  Ultfimet 
^iKiUves,  the  organizatioo  haa 
CjUjatly  bsi^  the  dsuflti.^»  task 
of  creoun^ihe  first  and  only 
kno«  n  dirchive  of  th«  Web 

"!t  a  Ln!>y>rL.vit  thot  >ve  an?  able  to  look  bocl 
531*1  K-thlo  «  ho  invented  the  pion*ennjt  Net 
■■   ■  ■•  .h        , ,  .•,.„,  'a-..™  1-:  \V.\1.=;  •Wilis  An 


"CWto  someone 

starts  saumg  it  with 

eirry  day  that  passes 

wi'rs  losing  the  record 

of  one  of  the  great 

turning  points  in 

human  hisTonj. " 


are  out  of  pnnt.  .And  the  sarae  holds  tr\!9  for  tbs 
lot^met ' 

1  ne  aierag*  docunaeni  on  the  \»rV&  has  iiS«- 
span  of  ~o  dilys.  jmd  for  tha  reason  historuna 
and  tedmology  pro&esioaAls,  iike  MitneofV^ 
Chief  T£<iint!cg:f  Officer  Nathaa  Myhr^tstd,  S^ 
Uev'e  Kahle  9  projea  tB  vital  in  a  mcriiiim  that  is 
QeetHLg 

The  goal  lb  to  have  an  unprecedented  perma- 
neni  record  of  ail  pubUdy  aocesfflble  data, 
wbethei  it's  »  aeigilbsi-hacu  home  psae  •jt  a  foi- 
eignlanguagB  Usenet  pofftmg  about  Japaneaa 
haiku,  b£  *ajd 

lutJOD  fS  hou3«l  in  a  space  no  bigger  than  an 
av^ra^  iivmg  rvoni  5o  lar  Kahie  obs  stijred 
four  tarabytee  of  the  Internet  on  a  conbioactoo 
of  upe  and  hard  clnv»  >d  hi£  San  FrwiciKi; 
offie«  r-'irr'TOkLTs  tha  GoSdsa  Gate  Sndgs.  lb 
give  vDu  a  sense  of  breadth  of  the  library,  fcur 
terabyte*  IS  equal  to  4  tmlhon  raegahytea.  And 
despjie  the  fan  that  tho  "Nat  doublss  m  size 
evety  sue  oooaths,  aooordmg  to  KohJe  his  digital 
si^vii  ia  ready  to  atuffthe  whalis  thiisg  lata  his 
offices  for  decades  to  aome 
Not  IhAt  It  s  ftil  in  demand. 

"Mo?t  of  >t  !S  junk,'  h»  ad- 
mit*. Ht  views  hi»  rtoonls  not 
as  a  data-Ltiorgus,  but  aa  a  liv- 
ing hjt»toiV  Between  CNNcDtn. 
the  aJt.t8St«!eJsjok88  msws- 
group,  and  th«  hom.?  page  fer 
The  DsUaa  Covi^boy  Chsartoad- 
en.  Kahje  beliavs  he  has  a^ 
turtd  the  Zeitgeist  of  the  Web 
Qjca  1990,  for  histonauB  bo 
pjcdss-  csrttiAxiss  froE  tolsj' 

The  Jogutics  of  taking-  stiap- 
shovsoftheWebarereiaJoveiy 
simple  The  aofVwara  on  KahJe's 
computert  "crawls' the 'Net 
'"^"-=  do-A'rtloading  !!ullio.-.a  of  Web 

p»gw  &Dro  one  aite  after  aooth' 
er  Oiice  a  page  \s  captured,  the  software  looks 
for  rro£E  rrfertnoes  or  Unlo.  to  other  pages  The 
softM'sre  then  raiikes  copies  agnin  and  seeka  ad- 
ilr.-r.|,Ai  linkrcf"TMip>'d  <u  the  nen-  oobts 


BOOKMARKS 


Site  Spotlight 


Who  can  keep  up  with  the  dirf  on  [he  Bold  ond  the  Boob-Jobbed  in  Hollywood'  If  you're  going  to  bite  t 
bullet  and  take  this  stuff  seriously,  it's  hme  to  try  these  entertainment  news  sources. 


The  Hollywood  Reporter 

www.hollywoodreporter.com 


TMRk 


Your  Prime  Sits  lor  EnMrtainment  News  end  It^ornntton 


Yahoo  Entertainment  Summary 


The  Reporter  claims  the  dufiious 
honor  of  being  the  first  in  a  venerable 
line  of  Tinseltown  trade  rags.  You  can 
peek  at  showbiz's  top  headlines,  but 
you  have  to  subscribe  to  get  to  full  sto- 
ries, the  archives  or  film  and  TV  pro- 
duction Ustmgs. 


www.yahoo.com/headiines/entertainment^  .^  'VWElOOE"^-^' 


A  metapage  for  entertainment  infor- 
mation. Yahoo's  daily  entertainment 
news  site  draws  stones  from  People 
magazine,  Vanety,  Reuters  and 
dozens  of  other  sources. 


|;in<l«x  |Naws|WofM|Blz|T«di|Po»a<:  [Snort  |Scorebo«rj|E»t«rtth[H 


Flashers 

>!^ww.flashers.(oni 


fiwis: 


Air  1  F  vf  •»  Fi-  W,r 


•       » 


Dig  up  the  dishiest  dirt  on  your 
favonte  supercheesy  TV  shows 
and  new  movie  premieres,  from 
"Baywatch"  and  "Xena"  to 
"Men  m  Black."  Copious  photos 
of  today's  Hollywood  "talent," 
from  Sherry  Lansing  to  the  babes 
from  "Baywatch." 


BroodcostiiQ  &  UDieOnhe 
www.broadaistingcable.(om 


Find  out  )ust  where  things  stand 
with  digital  TV,  cable  modems, 
telephony  and  rate  regulation  at 
Broadcasting  &  Cable  newsweekly's 
teaser  site.  Our  favorite  quote,  from 
that  semantic  genius  Al  Gore,  who 
is  pushing  to  make  the  ratings  sym- 
bols larger  on  TV's  sauciest  shows: 
"Today,  Amenca's  parents  have 
won  back  their  living  rooms." 


TV  Guide  Entertainment  Network 
www.tvguide.com 

Who  says 

the  Web  is  lust  a  vehicle  for 
the  tony  digital  elite?  Rupert 
Murdoch's  online  TV  Guide 
is  just  as  aggressively  mid- 
brow  as  the  prmt  version, 
and  includes  the  same  tired 
gossip  from  ubercolumnist 
Liz  Smith  and  "news"  on 
your  favonte  idiot-box 
cheesec;il<e  .md  beefcake. 


Bookmarks  is  compiled  bv  Michael  Mattis  and  Tish  Williams 
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Cache  This 

www.turtles.org/happen.htm#logger 

This  site  tracks  a  turtle's  migration  from 
California  to  Japan  in  a  way  that  opens  science 
to  students — an  example  of  sensors  gettmg  mte- 
grated  with  computer  networks  in  interestmg 
ways.  If  this  kind  of  thing  catches  on,  then 
maybe  life  doesn't  have  to  be  a  spectator  sport. 

www.epic.org 

The  Electromc  Privacy  Information  Center's 
aggressive  but  sane  presentation  of  one  of  the 
tnckiest  and  most  important  social  issues  hap- 
pening m  the  technology  sphere. 

pharmdec.wusti.edu/juju/surr/surrealism.html 

The  power  of                                                                   ; 

Web  publishing — 
mexpensive,  highly 
interconnected  and 
equaiizmg — allows 
for  sites  hke  this 
one  on  surrealism. 

Brewster  Kahle 
CEO,  Alexa  Internet 
www.alexa.com 

PreText  interview 


http://www.pretext.com./oct97/columns/intview.htm 
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preText 
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iPRETEXT" 

INTERVIEW 


Brewster  KaMe 

folks  to 
Susan  Dum^. 


We're  ercfiiving 

fhe  link 

sh-ycture  of  the 

whole  Net. 


During  the  last  decade  or  so, 
Brewster  Kahle  has  designed 
supercomputers,  invented 
WAIS  (Wide  Area  Information 
Servers  system),  and  founded 
the  Internet  Archive  (see  The 
Web  Museum).  His  newest 
venture  is  Alexa,  a  piece  of 
navigation  software  that  keeps 
track  of  the  paths  previous 
visitors  followed  from  a  given 
Web  site  and  then  makes 
recommendations  for 
additional  sites  to  visit  based 
on  their  choices.  Available  for 
free  from  the  Alexa  Web  site,  it 
just  may  change  the  way 
information  is  organized  and 
distributed  on  the  Internet. 

From  his  office  near  San 
Francisco's  Golden  Gate  Park, 
Kahle  spoke  by  phone  with 
PreText  writer  Susan  Dumett 
about  Alexa  and  the  future  of 
digital  libraries. 

Susan  Dumett:  How  does 
Alexa  work? 

Brewster  Kahle:  Alexa  is  based 
on  the  collective  wisdom  of  users 
who've  surfed  the  Internet  before 
you.  Alexa  asks,  "Where  did 
other  people  go  after  visiting  this 
site?  Let  them  find  the  quality 
and  weed  out  the  hype."  It's 
different  because  it's  a 
companion-it's  with  you  on  your 
screen  as  a  tool  bar  providing 
information  that  changes  based 
on  where  you  are  and  where  you 
might  want  to  go  next. 

So,  if  I'm  at  an  astronomy  site, 
Alexa  could  refer  me  to  sites 
other  people  have  visited  from 
this  site  that  may  or  may  not 
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Well,  you're  not  going  to  get  a 
sports  site  because  you're  not 
going  to  get  many  people 
reinforcing  that  particular  subject. 
We're  archiving  the  link  structure 
of  the  whole  Net.  We  can  find 
sites  related  to  the  page  you're 
looking  at--not  based  on  the  links 
on  the  page,  but  by  what  other 
people  have  thought  of  that 
page.  It's  kind  of  a  "have  you 
read  any  good  books  lately?" 
form  of  navigation. 


M  Hie  number 
of  sites  grows, 

I'm  not  syre 
we  can  expect 

directories  to 
keep  up. 


There  is  quality 
oyt  there. Ifs 
byried  omorig 

Ions  of 

irrevelent  stuff, 

but  ifs  out 

there. 


Wired  magazine  recently  ran  a 
stiort  piece  questioning 
Alexa's  value  as  people  lose 
interest  in  surfing  the  Net.  But 
Alexa  seems  not  so  much  a 
surfing  tool  as  an 
organizational 
tool-performing  the  role  of 
cyber-librarian  in  a  sense. 

Well,  librarian  isn't  the  most 
exciting  metaphor,  but  it  is  filling 
that  role.  Often  the  value  of  being 
in  a  library  is  not  the  book  you're 
looking  for,  it's  the  other  books 
on  the  shelves.  We're  organizing 
those  shelves.  Alexa's  not  a  surf 
medium,  but  a  medium  for 
finding  things. 

When  you're  over  looking  at  that 
particular  book  on  the  shelf,  we 
show  you  the  other  ones  around 
it--not  just  any  old  books  or  even 
similar  books,  but  ones  that  other 
users  have  thought  were 
important.  The  number  of  Web 
pages  is  now  over  100  million. 
Finding  the  right  information 
requires  pretty  sophisticated 
techniques.  We  think  the 
technique  that's  going  to  work  is 
leveraging  the  intelligence  of  the 
users.  It's  not  like  we  have  the 
smartest  robots  here.  We  think 
people  are  the  smart  part.  We 
help  people  leverage  what 
millions  of  others  have  done. 

What  do  you  mean? 

The  number  of  Web  sites 
doubles  every  six  months.  What 
are  the  current  technologies  for 
finding  things  right  now?  There's 
directories  that  have  editors  type 
in  what  they  think  are  the  best 
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woshlngtonpost:© 


washingtonpost.C^m  l  home  page  I  site  index  I  search  I  help 


Technology  Post:  Your 
plain-English  guide  to  the  digital 
world. 


Web  Exploration:  learn  your  way 
around  the  Web  through  guided 
tours. 

Technology  News  and  Analysis: 
explanations  of  issues  in  the 
digital  realm,  plus  updates  from 
The  Post  and  the  Associated 
Press. 


Alexa:  Unintelligible  Agent 

By  Dan  Pacheco 

Friday,  October  3,  1997;  Page  N70 
The  Washington  Post 

WE  web  users  are  a  fickle  breed.  On  the  one  hand,  we  want 
easy  access  to  any  information,  no  matter  how  obscure  or 
useless.  On  the  other,  we  resent  having  to  dig  around  to 
find  it.  In  an  ideal  world,  the  Internet  would  figure  out 
what  we're  looking  for  before  we  think  of  it. 

Hence  the  rise  of  supposedly  "intelligent"  programs  that 
attempt  to  anticipate  your  next  click.  The  latest  is  Alexa,  a 
Web  copilot  that  follows  you  around  and  recommends  sites 
it  thinks  you'd  like. 

Alexa  makes  its  decisions  in  two  ways.  First,  it  looks  for 
similarities  among  the  site  on  your  screen  and  others  stored 
in  the  Internet  Archive  —  a  compendium  of  (theoretically) 
every  Web  page  in  existence,  compiled  by  its  developers  . 
If,  for  instance,  this  analysis  indicates  most  Web  sites 
about  deer  also  mention  rifles,  Alexa  may  recommend  a 
hunting  site  to  someone  reading  about  whitetail  migration 
patterns. 

The  second  method  is  called  "collaborative  filtering."  As 
people  travel  the  Web  with  Alexa,  their  tracks  are  recorded 
anonymously  in  a  master  archive.  Users  can  also  vote  for 
sites  they  like  and  trash  ones  they  hate,  further  padding  the 
preference  list.  But  since  not  many  people  have  used  Alexa 
yet,  this  feature  isn't  all  there. 

After  a  week-long  trial,  that  was  obvious.  Like  an  annoying 
store  clerk,  Alexa  was  continually  pushing  the  equivalent 
of  ugly  slacks  and  sweaters  when  all  I  really  wanted  was  a 
pair  of  black  socks. 

At  its  best,  Alexa  bumped  me  from  the  University  of 
Colorado's  Web  site  to  schools  in  Hawaii,  Kansas  and 
Illinois.  At  its  worst,  it  made  bizarre  connections  between 
disparate  subjects;  I  still  can't  figure  out  what  Miller 
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Genuine  Draft  has  to  do  with  an  Internet  service  provider 
called  "The  Human  Factor." 

It  only  took  me  a  day  to  realize  that  Alexa's  limited 
universe  of  suggestions  and  categories  paled  in  comparison 
to  any  of  the  hand-picked,  more  populated  categories  of 
Yahoo  or  the  software-generated,  but  still  useful 
LiveTopics  feature  of  Alta  Vista  . 

More  interesting  are  the  details  that  Alexa  can  display 
about  a  site.  If  you're  feeling  nosy,  click  "Where  am  I?"  to 
pop  up  a  tiny  window  with  a  one-to-five-star  rating,  the 
name  and  address  of  the  site  owner,  the  relative  speed  of 
the  site  over  time,  and  the  gross  revenue  of  the  owning 
company  in  1996.  This  can  be  enlightening;  I  had  no  idea 
that  washingtonpost.com,  the  Web  site  I've  worked  on  over 
the  past  year  and  a  half,  contained  38,000  pages. 

Elsewhere  in  Alexa,  a  book  icon  brings  up  a  search  form 
for  Encyclopedia  Britannica  Online  and  the 
Miriam- Webster  Dictionary  .  A  telephone  icon  supposedly 
lets  you  talk  with  other  Alexa  users,  but  in  a  week  I  never 
came  across  one  chatter. 

But  it's  the  smallest  icon,  curiously  tucked  away  at  the 
right-hand  corner  of  the  screen,  that  holds  the  most 
promise  -  and  delivers  the  most  disappointment.  If  you 
ever  come  across  a  missing  page  ("Error  404:  File  Not 
Found"),  clicking  this  icon  will  retrieve  a  copy  from  the 
Internet  Archive  itself.  Unfortunately,  Alexa  only  delivers 
one  randomly-chosen  version,  usually  around  six  months 
old. 

More  useful  would  be  a  feature  that  lets  you  retrieve  every 
version  back  to  a  site's  beginnings  -  something  the 
Archive  should  have  no  problem  doing.  Then  again,  who 
said  there's  any  such  thing  as  a  free  lunch? 

Alexa,  Win  95  (requires  Netscape  Navigator  2.0  and  up  or 
Microsoft  Internet  Explorer  3.0  and  up),  free  at 
http://www.alexa.com 

©  Copyright  1997  The  Washington  Post  Company 
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Net  Surfin  with 
Mike  Wendiand 

Welcome  to  Net  Surfin  with  Mike  Wendiand. 
Here  you'll  find  all  the  sites  that  Mike 
features  on  his  weekly  segment  seen  on 
CNBC's  "Steals  and  Deals"  and  sent  out  to 
all  215  NBC  stations  in  the  U.S.  by  the  NBC 
Newschannel.  An  archive  of  previous  Net 
Surfin'  stories  also  is  available. 


TlPjlljfeU;-;.; 


Check  out  Mike's  high-tech  special  reports.  Latest  story:  Beware  of  Virus  IVIyths. 

Notice!  Mike's  High  Tech  pages  now  use  Flash!  animations  for  its 
top  navigation  bar.  If  you  doi 
download  the  plug-in  today! 


P'.|fc  H^'toJ^^^I  top  navigation  bar.  If  you  don't  see  the  navigator,  make  sure  to 


The  Net  Surfin '  Theme  is  an  original  composition  by  Dan  Bowyer. 


Join  UsI 

Find  answers  to  your  computer  questions,  talk  about  high-tech 
issues  and  join  the  High  Tech  Tall</Net  Surfin  online  connmunity! 
It's  all  at  Mike's^  |MM^tf n  Ppar^  Jn/.  't  today !^  ^ 

Latest  Selection:  Alexa 

It's  called  Alexa  ...  named  after  the  fabled  Library  of  Alexandria  in  ancient 
Egypt ...  the  world's  first  and  last  attempt  to  catalogue  it's  collected 
knowledge.  And  in  that  same  spirit  this  fi"ee,  downloadable  program  tries  to 
do  the  same  thing  on  the  World  Wide  Web. 


@ 


After  you  download  it,  it  lies  at  the  bottom  of  your  screen  whenever  you're  on 

the  Web...  The  Alexa  toolbar  is  ready  to  tell  you  who  owns  the  site  you're  visiting,  other 

similar  sites  you  may  want  to  check  out,  and  how  popular  it  is,  or  how  many  hits  it  receives. 
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There's  also  an  instant  messaging  system  that 

WDIV  TV  ^       Ik      allows  users  to  communicate  with  other  Alexa 

Hits:  Top  half        S       W      people,  too. 


But  my  favorite  feature  is  an  instantly  accessible  link  to  Encyclopedia  Britannica  reference 
tools  ...  like  the  dictionary  and  thesaurus.  You  can  also  get  to  the  online  edition  of  the 
encyclopedia,  but ...  warning  ...  they  charge  for  total 
access,  though  Alexa  users  can  get  a  free  trial 
subscription. 


This  Alexa  program  is  not  a  search  engine.  It's  called  a 
navigation  service  and  what  it  does  is  add  context  to  all 
that  content  out  there,  a  big  help  as  the  Web  continues  to 
double  in  size  every  six  months. 


EftcyclopaerfUi 


Tf»«s»urus 


Do  you  have  a  favorite  site  on  the  World  Wide  Web  that 

you  think  would  interest  our  nationwide  audience?  If  so, 

just  tell  Mike.  If  Mike  uses  your  suggested  site  on-the-air,  he'll  send  you  a  supercool  NET 

SURFIN'  T-shirt.  Don't  forget  to  include  your  name  and  address! 

Got  a  comment  about  computers  or  life  on  the  information  superhighway?  You  can 
contact  me  anytime,  online  at  mike@pcmike.com.. 


Virus  Warnings  -  Watch  Out  for  Hoaxes! 

Heard  the  latest  warning  about  the  destructive  new  computer  virus?  Well,  maybe  you  should 
think  again.  Before  you  panic  at  the  thought  of  a  computer  virus  wiping  out  your  machine  ... 
the  next  time  you  receive  one  of  those  e-mail  "the  sky-is-falling!"  warnings  about  some 
supposedly  "new"  virus  ...  rush,  don't  walk,  to  http://www.kumite.com/myths/.  Most  virus 
scares  are  baseless,  hysterical  and  unfounded. 

Don't  ever  pass  on  a  virus  warning  unless  you  KNOW  it's  real. 

Don't  take  the  e-mail  sender's  word  for  it,  check  it  out  yourself  and  remember,  it's  in  the 
interest  of  the  anti-virus  software  makers  to  get  as  many  people  worried  as  possible. 
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This  Web  navigation  service  works  with  Microsoft 
Internet  Explorer,  Netscape  Navigator,  and 
Netscape  Communicator  browsers  to  suggest  Web 
sites  you  might  want  to  visit  next  based  on  the  site 
you're  currently  checking  out. 
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After  the  search  engines  come  the  syndicators,  companies  looking  for  value 
in  reusing,  promoting  and  linking  sites  and  content  to  drive  value.  Keep  an 
eye  on  this  new  breed  of  Web  content  aggregators. 

Several  interesting  examples  of  the  different  styles  of  syndication  are 
arriving  on  the  Net  this  month,  including  NetEvents  Rapid  Content,  Alexa 
Internet's  archive  and  "where  to  go  next"  features,  PointCast's  and 
Netscape's/Individual's  targeted  vertical  information  channels,  and  the 
evolving  Mining  Company. 

Even  the  Electric  Library  and  Northern  Light,  two  search  engines  that 
incorporate  extensive  archives  of  articles  from  the  paper  press,  fall  into 
this  category,  since  they  try  to  extract  more  revenue  from  reused  content. 

I'll  go  out  on  a  limb  here:  This  category  represents  the  next  source  of 
explosive  growth  on  the  Net,  and  the  path  fraught  with  the  greatest  peril 
for  startups. 

Remember,  the  real  money  in  television  is  made  in  syndication.  Sheldon 
Leonard,  William  Shatner  and  Bob  Denver  made  their  fortunes  on  the 
thousands  of  reruns  of  their  programs.  First -run  shows  seldom  earn  back 
their  production  costs.  Can  the  same  system  work  on  the  Net?  The  answer  is 
a  big  "yes. " 

Syndication  is  the  art  of  picking  the  right  content  for  an  audience  that 
can  be  reached  at  the  lowest  possible  cost  --  and  that  cost  includes 
creative  and  marketing  costs,  not  just  the  raw  expense  of  network 
connectivity. 

On  the  Net,  that  means  the  syndicator  has  to  begin  to  reuse  content  almost 
at  the  moment  it  is  posted. 

Obviously,  this  presents  some  problems,  what  with  copyright  and  other  nasty 
intellectual  property  issues  still  so  up  in  the  air.  But,  because  the  Net 
allows  content  to  be  reused  to  fill  other  sites  without  actually  creating 
copies  of  the  original  material,  there  is  ample  opportunity  to  capitalize 
on  the  floods  of  information  merely  by  directing  it  into  manageable 
channels . 

The  question  is  whether  the  syndicator  has  the  right  model.  Three  basic 
strategies  are  emerging,  which  I'll  examine  in  my  upcoming  columns.  In  a 
nutshell,  the  syndicator  business  models  are: 

1)  Branded  sites  that  deliver  a  collection  of  materials  from  other  sites. 
These  sites  add  value  through  editorial  or  personalization  features  -- 
either  they  sort  through  the  Web  for  content  appropriate  for  their  audience 
or  they  build  sophisticated  back-end  systems  that  filter  information  for 
the  user.  This  group  includes  PointCast  Editions,  Netscape/Individual 
channels,  and  most  news  sites,  including  CNN  and  our  own  dear  ZDNet,  which 
use  third-party  sources  along  with  their  own  content. 

2)  Pass-through  sites  and  servers  that  deliver  aggregated  content  streams 
to  a  third-party  Web  site  that  puts  its  own  brand  on  the  repackaged 
content.  This  is  the  NetEvent  strategy  and  The  Mining  Company's  corporate 
channels  strategy,  the  first  example  of  which  was  with  CitiBank. 

3) .Omnigators,  sites  that  collect  and  personalize  information  from  across 
the  Web  and  can  deliver  services  through  their  own  sites  or  client 
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software,  or  through  third-party  sites.  These  sites  include  Alexa  and  the 
consumer  side  of  The  Mining  Company,  which  uses  human  guides  to  collect  and 
deliver  content  from  across  the  Net. 

Each  of  these  strategies  could  be  successful,  if  the  provider  is  careful 
with  its  resources,  whether  human  or  computational.  The  problem  for  all  is 
growth  and  scalability.  As  we've  seen  throughout  the  history  of  the  Web, 
exponential  growth  can  destroy  the  best  laid  plans. 


THE  NEW  SYNDICATORS,  PART  II 

Having  laid  out  the  foundations  of  the  Net  syndication  market,  let's  take  a 
look  at  the  challengers  to  the  crown.  As  I  said  in  my  last  column,  the  real 
trick  to  syndication  is  combining  a  low  cost  with  a  high  volume  of 
exposures  in  order  to  drive  revenue. 

I've  grouped  the  sites  according  to  the  general  strategies  they  follow. 
Some  are  branding  themselves,  while  others  are  planning  to  charge 
third-parties  to  place  a  brand  on  their  content.  Finally,  one  is  taking  the 
big  leap,  trying  to  sweep  everything  on  the  Web. 

You'll  notice  that  The  Mining  Company  shows  up  under  two  categories.  It's 
the  most  comprehensive  approach  to  the  Web  syndication  business,  and  one  of 
the  strongest;  consequently,  it  also  faces  the  greatest  hazard  of 
syndication  --  can  it  keep  up  with  the  demand  for  more  human  and  business 
relationships  to  keep  its  business  afloat? 

Branded  sites 

*  PointCast  Editions.  The  paragon  of  a  broadcast  channel  on  the  Web, 
PointCast  will  rely  on  third-parties  to  deliver  aggregated  content  that 
deals  with  specific  industrial  categories.  EDS,  a  former  General  Motors 
subsidiary,  for  instance,  will  handle  the  automotive  industry  news.  As  long 
as  PointCast  can  find  partners,  its  costs  will  be  manageable.  And  if  they 
can't  find  partners  to  take  on  a  new  "edition,"  they  are  not  compelled  to 
grow. 

At  375  million  hits  a  month,  the  basic  service  is  a  formidable  foundation 
for  growth  through  segmentation  into  attractive  markets,  which  is  the  very 
soul  of  Web  syndication. 

*  Netscape  Netcaster  vertical  reports.  Like  PointCast,  Netscape  and 
Individual  Inc.  hope  to  attract  a  loyal  audience  from  the  alleged  60 
million  Netscape  Navigator  and  Communicator  users  on  the  Web.  And,  also 
like  PointCast,  the  venture's  ability  to  grow  is  limited  only  by  the 
capacity  of  its  partners  to  bring  content  and  context  together. 

But,  by  turning  the  Netscape  home  page  into  a  fee-based  grazing  pasture  for 
big  content  providers,  Netscape  runs  the  risk  that  it  will  alienate  the 
smaller  content  publisher,  who  they  also  need  as  server  customers.  The 
answer:  a  PointCast  Connections-like  channel  that  gives  anyone  access  to 
the  Netscape  home  page  traffic. 

*  Electric  Library.  One  of  the  best  search  sites  on  the  Web,  Electric 
Library's  real  business  is  selling  library  cards  that  grant  unlimited 
access  to  an  attractive  archive  of  information  from  expensive  journals  and 
photo  sources . 

This  can  be  a  very  lucrative  business,  if  their  collection  continues  to 
grow  and  remains  an  attractive  alternative  to  the  free  stuff  on  the  Web.  At 
$60  a  year  for  a  subscription,  plus  advertising  revenue,  this  is  a 
syndication  move  that  has  legs. 

*  Northern  Light.  The  one-off  sales  version  of  the  Electric  Library, 
Northern  Light  maintains  its  own  extensive  library  of  content  that  you 
usually  can't  find  on  the  Net,  like  Foreign  Policy  and  New  Perspectives 
Quarterly. 
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Northern  Light  will  sell  access  to  individual  articles,  along  with  a 
better-than-average  search  engine  that  sweeps  in  the  contents  of  the  entire 
Net.  With  advertising  revenue  and  a  loyal  following,  which  the  best  search 
engines  have  proven  they  can  build.  Northern  Light  is  perhaps  even  more 
profitable  than  Electric  Library  can  be,  since  a  customer  can  spend  and 
spend  all  year  long. 

*  The  Mining  Company.  The  consumer  side  of  The  Mining  Company  site  brands 
human  guides  who  lead  visitors  to  the  best  of  the  Web.  It's  a  unique 
editorial  play,  but  one  that  is  very  difficult  to  scale. 

However,  because  The  Mining  Company  is  pursuing  several  lines  of  business 
simultaneously,  using  the  same  content  in  different  packages,  it  has  the 
potential  to  create  the  highest  value  from  each  piece  of  information 
syndicated  by  its  guides. 

Pass-through  sites  and  servers 

*  NetEvents  Rapid  Content.  A  pure  syndication  play,  the  NetEvents  team 
would  like  to  be  the  King  World  Syndicate  for  the  Web,  placing  its  content 
in  a  wide  variety  of  sites,  just  as  cartoons  are  distributed  for  use  in 
many  newspapers . 

NetEvents'  customers  are  sites  that  want  to  blend  information  from  many 
sites  to  create  something  that  draw  customers  back  again  and  again.  That 
means  the  company  has  to  find  the  best  of  the  Web,  establish  a  profitable 
relationship  with  the  provider  of  that  information,  and  then  sell 
third-party  sites  on  the  desirability  of  the  content. 

Its  first  relationships  revolve  around  the  well-worn  personal  investment 
information  category,  so  it's  hard  to  tell  whether  NetEvents  really  has  the 
vision  to  open  new  markets  for  information  syndication. 

*  The  Mining  Company.  It's  deal  with  Citicorp,  which  will  populate  banking 
sites  with  content  drawn  from  its  guides  sites,  like  investing  and  small 
business  consulting,  is  another  pure  syndication  play.  At  more  than  $3 
million  for  the  CitiCorp  deal,  this  can  be  a  very  good  business. 

Omnigators 

*  Alexa  Internet.  An  archive  of  the  entire  Net,  with  extensive 
recommendation  and  rating  capabilities,  the  Alexa  service  is  an  odd  hybrid 
of  collaborative  filtering,  buddy  list  communications  and  Web  navigation 
assistance . 

Alexa  attempts  to  offer  users  a  comprehensive,  yet  personalized,  guide  to 
the  Net,  a  very  difficult  trick  to  pull  off.  The  facility  of  the 
client/server  technology  Alexa  uses  is  unproven,  though  founder  Brewster 
Kahle  assures  that  the  system  "learns  over  time  and  gets  better  with  use." 

If  it  works,  it  could  become  a  de  facto  interface  for  mapping  the 
relationship  between  different  pages  on  the  Web.  As  a  syndication  business, 
the  ability  to  provide  context  anywhere  on  the  Web  could  be  the  most 
powerful  differentiator  of  all. 

Next  week,  in  part  three,  I'll  look  at  the  winners  and  potential  losers  in 
the  battle  of  the  syndicators. 

Mitch  Ratcliffe  is  president  of  Internet/Media  Strategies  Inc. 
(www.ratcliffe.com),  a  Tacoma,  Wash.,  consultancy.  He  can  be  E-mailed  at 
godsdogOratcl i  f  f e . com . 

THE  NEW  SYNDICATORS,  PART  III 

There  is  more  than  one  way  to  skin  a  cat,  and  more  than  one  way  to 
syndicate  the  Net.  Several  of  the  companies  we've  been  discussing  in  my 
past  two  columns  will  succeed,  if  they  can  combine  quick  responses  to 
changing  business  situations  with  a  scalable  technology. 
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Human  skills  and  human  resources  will  be  the  deciding  factor  in  the 
syndication  race.  The  technology  for  doling  out  information  is  plenty 
mature,  so  there's  not  a  lot  of  differentiation  to  achieve,  but  in  the 
realm  of  editorial  value  the  opportunity  to  add  value  is  almost  unlimited. 

In  an  environment  filled  with  information,  editorial  and  business 
development  savvy  can  be  applied  to  building  syndication  channels  that 
serve  from  100  to  100  million  people  profitably. 

I  think  The  Mining  Company  has  the  right  basic  idea,  though  I  suspect  it 
has  a  lot  of  work  to  do  on  implementation  to  make  itself  succeed.  The  guide 
concept,  that  a  person  with  interest  and  expertise  in  a  particular  area  is 
the  best  filter  for  that  area,  is  a  great  one. 

Moreover,  the  company  is  distributing  its  human  resource  costs  across  the 
Net.  It  needn't  hire  people,  it  merely  needs  to  convince  potential  guides 
that  it  can  market  and  pay  them  better  than  others. 

But,  the  two  key  problems  at  The  Mining  Company  will  come  as  consumers  and 
business  look  for  contrasting  opinions  --  currently,  the  company  assure 
guides  exclusivity  in  their  areas  of  interest,  and  the  best  guides  will 
recognize  their  increasing  value  and  demand  more  money. 

The  exclusivity  problem  can  be  solved  by  introducing  meta-guides,  who 
assemble  several  guides'  sites  into  a  multiple-perspective  online 
publication;  but  this  costs  money,  up-front  for  editorial  employees.  This 
will  be  crucial  to  the  company's  corporate  services  effort.  How  they  will 
deal  with  guides  making  demands  for  more  money,  or  a  guide  union,  is 
anyone's  guess. 

The  NetEvents  strategy,  to  syndicate  content  through  its  servers  to 
third-party  sites,  is  the  cleanest  and  most  scalable  of  the  companies  in 
this  article.  They  needn't  build  a  lot  of  sites,  just  shovel  the  content 
and  value-added  features,  like  reader  rating  systems,  through  to  the 
third-party  site,  where  it  will  appear  under  another  brand.  The  content  is 
still  served  off  the  original  providers'  sites,  so  NetEvents  will  not  have 
to  expand  its  server  resources  to  support  more  content. 

"This  allows  us  to  scale  quickly  by  focusing  on  very  efficient  distribution 
techniques  rather  than  building  the  mother  of  all  centralized  networks," 
said  Joel  Maske,  CEO  of  NetEvents. 

Making  the  most  of  all  those  connections,  though,  is  a  very  tough  job. 
NetEvents  must  develop  very  refined  packaging  skills,  or  there's  no  reason 
for  content  providers  not  to  compete  against  them  for  hits.  The  company 
will  put  20  percent  of  its  resources  against  finding  and  making  deals  with 
the  Web's  best  content  providers.  Most  of  that  should  go  not  to  Web  surfers 
ala  Yahoo!,  but  for  the  first  generation  of  Web  talent  agents  and 
acquisitions  editors. 

Electric  Library  and  Northern  Light,  search  engines  combined  with  fee-based 
special  collections,  may  find  themselves  hamstrung  by  costs.  In  order  to 
succeed,  they  have  to  overcome  the  popularity  of  other  search  engines. 

As  a  recent  NPD  Group  report  found,  search  engine  users,  particularly  Alta 
Vista  users,  are  very  loyal  and  visit  the  engines  frequently.  For  either 
ELibrary  or  Northern  Light  to  compete,  they  have  to  win  users  through 
better  overall  search  capabilities.  That  means  they  have  to  build  out  their 
systems  to  support  more  traffic,  and  those  extra  people  may  not  necessarily 
pay  enough  to  support  the  added  hardware.  Elibrary  definitely  enjoys  an 
advantage,  since  it  charges  an  up- front  membership  fee. 

PointCast  and  Netcaster,  the  old  hands  in  this  game,  will  confront  both 
scale  and  business  relationship  problems  as  their  syndication  businesses 
grow.  PointCast,  though,  is  far  better  prepared  to  handle  the  argument  that 
they  have  begun  to  compete  against  their  content -providing  customers  on  the 
regular  PointCast  network  --  if  someone  complains  loudly,  PointCast  can 
simply  promote  them  to  a  PointCast  Edition,  one  of  the  syndication 
networks . 

Netscape,  as  I've  said  elsewhere,  will  certainly  alienate  content  providers 
by  pricing  space  on  its  popular  home  page  out  of  many  smaller  companies' 
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reach.  That  may  cost  Netscape  in  the  server  market,  as  its  customers  turn 
to  other  "non-competitive"  servers  (after  all,  who  wants  to  subsidize 
competition  for  readers  by  buying  a  server?  Well,  besides  Microsoft 
customers?) . 

Finally,  Alexa,  the  omnigator  syndication  engine,  is  really  on  to  something 
with  its  catalog-everything-and-map-links-between-sites  strategy.  However, 
in  order  to  ensure  availability  of  data,  it  has  to  store  everything  it 
finds  on  the  Net.  This  means  it  is  creating  copies  of  sites,  which  raises 
all  sorts  of  copyright  flags.  Someone  will  sue  Alexa,  and  soon.  Alexa  has 
technical  workarounds,  but  those  won't  prevent  the  lawyers  from  striking. 
The  company's  success  ultimately  depends  on  its  copyright  strategy  and  its 
lawyers  abilities. 

On  a  scale  of  one  to  ten,  The  Mining  Company  and  NetEvents  earn  a  7.5, 
PointCast  a  7.0,  Electric  Library  a  6.0,  Northern  Light  a  5.0  Alexa  a  4.0 
and  Netscape  Netcaster  a  2.9.  So,  place  your  bets  and  enjoy  the  syndication 
ride . 

Mitch  Ratcliffe  is  president  of  Internet/Media  Strategies  Inc. 
(www.ratcliffe.com),  a  Tacoma,  Wash.,  consultancy.  He  can  be  E-mailed  at 
godsdog@ratclif f e . com. 


•End  forward  message- 


Printed  for  Cynthia  Lohr  <cynthia@alexa.com> 


11/13/1997   11:33 


4158961094 


ANTENNA  GROUP 


PAGE   06 


Canton  Observer 

Canton,  MI 
October  12,  1997 


miitmBumms^J'inanaP 


Information  overload 

World  Wide  Web  keeps  growing  and 


I  don't  know  about  you,  but  I'm 
getting  tired.  The  information 
overload  on  the  Internet  has 
become  simply  unmanageable.  I 
spend  two  hours 
a  day  online  just 
keeping  track  of 
new  sites.  And 
every  night  when 
I  log  off,  there  are 
still  a  couple 
dozen  more  that  I 
don't  have  the 
time  to  click. 

Another  survey 
came  out  this 
past  week  that 
reported         the 

number  of  Web     

sites       on       the 

Internet  is  continuing  to  double 

every  six  months.  Terrific. 

B.ut  I  did  come  across  one  site 
this  past  week  that  offers  relief 
and  convenience. 

It's  called  Ale.xa 

(www  aJe.xa.com)  and  it's  named 
after  the  fabled  and  long  lost 
library  of  Alexandria  in  ancient 
Egypt,  the  first  and  last  attempt 
to  archive  and  catalogfue  the  col' 
lected  wisdom  of  the  world.  In 
that  same  spirit  the  free,  down- 
loadable program  you  can  pick 
up  at  the  Alexa  3ite  tries  to  do 
the  same  thing  on  the  World 
Wide  Web. 


For  more  than  a  year,  the  peo- 
ple at  Alexa  have  been  copying 
every  Web  site  they  come  across. 
As  of  last  month,  they  had  more 
then  seven  terabytes  of  informa- 
tion on  more  than  500.000  sepa- 
rate Web  sites.  A  terabyte  is  a 
million  megabytes. 

Know  how  big  that  is''  Consid- 
er that  all  of  the  books  in  the 
average  branch  of  a  public 
library  contain  about  three  ter- 
abytes of  data. 

That  massive  Internet  archive 
they've  collected  is  used  by  the 
Ale.-«a  program  to  provide  rele- 
vant recommendations  on  where 
to  go  next  on  the  Web.  They  call 
it  "intelligent  navigation." 

The  download  installs  with 


just  a  mouseclick  and  thereafter, 
it  lies  at  the  bottom  of  your 
screen  whenever  you're  on  the 
Web. 

There's  also  an  instant  mes- 
saging system  that  allows  users 
to  communicate  with  other 
Alexa  people,  similar  to  Ameri- 
ca Online's  Buddy  notification 
system  or  ICQ  from  MirabUis 
(www.mirabilis.com). 

My  favorite  Alexa  feature  i6 
an  instantly  accessible  link  to 
Encylopedia  Britannia 

(www. eb  com)  reference  tools, 
like  the  dictionary  and  the- 
saurus. You  can  also  get  to  the 
online  edition  of  the  encylope- 
dia but...  warning...  they  charge 
for  total  access,  though  Alexa 
users  are  offered  a  free  trial  sub- 
3criptif>n. 

Ale.xa  only  works  with  Win- 
dows 95.  Macintosh  and  Win  3.1 
versions  won't  be  available  until 
mid.1998. 
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Case  Study 


The  Daumm  Task 
Of  STORmG  The  Web 


BY  CHUCK  MOOZAKIS 


TIL\CKING  THE  CONTENTS 
of  the  World  Wide  Web  is  one 
thing  Storing  the  contents  15 
quite  another 

But  storing  the  World  Wide 
Web  is  exactly  m  hat  Ale\a  Internet 
does  each  and  e\  ery  day 

That's  part  of  the  purpose  of 
Alexa,  a  fret  Web  navigation  ser- 
vice launched  late  last  month 
AJexa  also  trjcks  the  sites  Internet 
users  visit  on  the  Web  and,  takin'/ 
a  page  from  Consumer  Reports, 
grades  the  value  of  a  site  bjsed  on 
a  series  of  customized  benchmarks 
and  other  metrics  that  vary  de- 
StOSHS.  I>«1(  3S  >• 
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Alexa 

(Alexa  Internet) 

Free  Web  Navigation  Software 

Why  wander  the  Web  aimlessly 
when  you  can  use  a  guide?  Get 
Alexa  and  always  know  where  to 
go  on  the  Web. 

Alexa  is  a  toolbar  application  that 
runs  with  your  browser  (including 
Netscape  browsers  and  Microsoft 
Internet  Explorer)  to  provide 
information  on  sites  that  you  visit 
as  well  as  suggested  links  on  where 
to  go  next.  These  recommendations 
are  based  on  the  surfing  paths  of 
other  Alexans,  so  you  have  tens  of 
thousands  of  people  helping  you 
find  the  best  sites  on  the  Web. 
Alexa  is  the  first  anonymous, 
multisurfing,  Web  site  suggestion 
tool  and  conferencing  application. 

Alexa  also  lets  you  send  messages 
to  other  Alexans  via  its  Instant 
Messaging  service.  It  also  has  a 
desktop  reference  feature  that 
always  allows  you  to  look  at 
something  else  without  leaving  the 
page  you're  on.  You  can  even 
review  old  Web  sites  (especially 
helpful  when  you  get  a  "404  not 
found"  error  message)  through  the 
Alexa  Archive. 

Why  bump  around  without  a 
guide?  Get  Alexa  and  know  where 
to  go. 

Please  visit  our 
website:  www.alexa.com 
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Text  and  photos  by  Tish  Williams 


We  thought  we'd  lost  Brewster 
Kahle  under  a  mountain  of 
Is  and  Os  in  his  crusade  to 
archive  the  entire  Web.  Miraculously, 
Kahle  broke  free,  surfacing  unsquashed 
on  July  17  to  launch  a  brand-new  com- 
pany, Alexa  Internet. 

Alexa  offers  a  new  type  of  browser 
that  provides  a  s>Tiopsis  of  each  site 
you  happen  upon,  lets  you  know  who 
owns  the  site,  tells  you  how  popular 
that  site  is  based  on  other  visitors' 
time  there  and  recommends  alterna- 
tive sites  of  the  same  genre  that  are 
the  most  trafficked.  Keeping  Kahle's 
pet  project  on  top  of  the  Web,  Alexa 
will  share  the  fruits  of  its  labor  with 
the  Internet  Archive. 

The  martini-bolstered  crowd  re- 
sponded enthusiasticaUy  when  Kahle 
showed  how  Alexa  draws  from  the 
archive  during  searches:  If  users  try  to 
cUck  through  to  an  item  during  thefr 
search  and  find  that  the  page  no  longer 
exists  on  its  server — restilting  in  the 
dreaded  404- Not  Foimd  error — Alexa 
displays  a  chckable  icon  that  accesses 
an  archived  version  of  the  lost  page. 
"These  are  just  some  of  the  fun  things 
you  can  do  if  you  have  five  T3s  in 
your  backyard,"  Kahle  said. 

Imagine  a  world  without  404- 
Not  Founds.  Now  there's  paradise. 
Or  an  office  with  five  T3s  ...  now 
that's  paradise. 
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Your  Complete  Guide  to  Searching  the  Net 

The  Future  of  Search 


These  pages  offer  a  good  guide  to  searching,  but  it's  apparent  that  such 
tools,  no  matter  how  advanced,  still  can't  keep  up  with  the  Internet.  The 
amount  of  data,  the  rapidity  of  change,  and  the  hectic  web  of  links  pulling  it 
together  make  truly  coherent,  comprehensive  organization  nearly  impossible. 
People  differ  on  where  the  problem  lies-user  interface,  data  collection  and 
analysis,  speed  limitations  of  hardware,  and  so  forth--but  clearly,  we  need  a 
new  solution.  Here  is  a  sampling  of  new  approaches  from  the  information 
industry. 

Mapping  Companies  such  as  Perspecta  (wv^aa/. perspecta.com)  and  Semio 
Corp.  (vwvw.semio.com)  offer  Java-based  products  that  analyze  and  organize 
documents  by  concept  and  attribute.  The  products  then  respond  to  mouse-  or 
text-based  input  with  dynamically  generated  visual,  navigable  maps  of 
relationships  and  hierarchies. 

Collaborative  Filtering  Alexa  Internet's  (vwAA/.alexa.com)  free  downloadable 
toolbar  stays  on  your  desktop  while  you  surf  and  provides  statistics  and 
owner  information  for  the  sites  you  visit.  Using  data  on  where  other  users 
who  have  visited  a  site  have  gone  as  well  as  link  and  text  analyses  of  the 
site,  Alexa  dynamically  suggests  other  links. 

Specialization  NewsBot  (www.newsbot.com).  from  HotBot,  is  a  standalone 
ActiveX  control  that  you  access  from  your  desktop.  You  can  search  the 
NewsBot  database  of  top  news  sites  using  an  interface  similar  to  a  Web 
search  screen. 


Client-Side  Metasearching  Prompt  Software's  WebSleuth 
(wvvw.promptsoftware.com)  is  another  in  the  vast  array  of  client-side  search 
tools.  WebSleuth  lets  you  query  an  unlimited  number  of  search  sites 
simultaneously;  the  program  analyzes  returns  and  drops  unrelated  and 
broken  links  before  giving  you  your  results. 

Personal  Agents  Inquisit  (www.inquisit.com)  targets  business  professionals 
with  a  subscription-based  "personal  intelligence  service."  Users  set  up  agents 
with  ongoing  queries;  the  agents  monitor  Inquisit's  database  of  news  and 
information  services  and  send  e-mail  updates  at  times  specified  by  the  user. 

Human  Contribution  Sites  like  LookSmart  and  Yahoo!  use  people  to 
analyze  and  categorize  the  sites  in  their  databases.  The  Mining  Company 
(www.miningco.com)  takes  this  a  step  further  with  its  cooperative  of  sorts. 
Users  apply  to  be  Guides  for  subsites  on  specific  topics.  Guides  are 
responsible  for  the  focus  of  their  sites  and  for  updating  links  and  adding  new 
ones. 
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As  the  inventor  of  the  WAIS  (Wide  Area  Information 
Servers)  system  and  founder  of  WAIS  Inc.,  Brewster 
Kahle  pioneered  publishing  on  the  Net.  Kahle's  latest 
-.  oiect,  Alexa,  is  an  Internet  archive  -  a  time  capsule 
or  sorts  -  in  an  effort  to  preserve  everything  on  the 
Net  as  history  in  the  making. 

CMQ:  Why  do  we  need  to  preserve  a  snapshot  of 
the  World  Wide  Web? 

BR:  The  real  opportunity  in  archiving  and  datamining 
the  WWW  is  to  produce  a  wonderful  navigation  service 
that  gets  better  as  it  grows  in  content  and  users. 

Archiving  the  Net  is  a  necessary  first  step,  but  then 
making  it  useful  to  everyone  is  what  the  Alexa  project 
is  all  about.  Named  after  the  Library  of  Alexandria, 
.\lexa  is  looking  to  build  a  worldwide  resource  for  those 
that  use  the  WWW. 

Alexa  is  a  free  service  and  is  available  from 
wvvw.alexa.com.  It  will  improve  as  more  people  use  it 
because  it  learns  from  people.  In  many  ways,  we  are 
building  a  huge  digital  library  that  becomes  smarter  as 
'he  Net  grows  and  people  use  the  service. 

What  progress  have  you  made  to  date? 

We  have  collected  a  full  snapshot  of  the  public  Web, 
including  pictures  and  video  clips.  Furthermore,  we 
have  kept  collecting  the  changes.  We  have  seven 
terabytes  of  data  now,  and  it  is  growing  at  one  terabyte 
a  month. 

To  put  this  in  perspective,  all  the  text  in  the  US 
Library  of  Congress  is  20  terabytes.  (A  terabyte  is  a 
niillion  megabytes,  and  a  book  is  about  one  megab\1e 
of  text.) 

The  data  has  many  uses  and  we  are  using  it  in  many 
interesting  ways  including: 

•  Finding  patterns  in  the  Web  to  build  a  navigation 
service  that  learns  from  people.  This  is  the  core  of 
Alexa's  service.  A  lot  can  be  learnt  from  "datamining" 
the  link  structure  and  aggregate  usage  patterns  to 
.:uide  future  users  of  the  Web.  Search  engines  and 
directories  don't  scale  up  well  as  the  Web  grows. 

•  Tracking  changes  in  the  Web  and  its  use  to  find  areas 
of  growing  and  shrinking  interest:  What's  hot?  What's 
up-and-coming? 

•  Offering  historians  and  scholars  an  unprecedented 
collection  of  human  voices. 

•  Keeping  a  record  of  the  birth  of  a  new  medium,  and 
the  dreams  people  have  for  it. 

What  guidelines  would  differentiate  a  digital 
library  from  current  directories? 

Alexa  is  different  from  current  directories  and  subject 
lisfings  in  the  following  ways: 


•  Alexa  is  a  navigation  service  that  uses  the  link 
structure  of  the  Net,  content  of  the  Web  pages,  and  the 
aggregated  usage  paths  to  suggest  Web  pages  and  sites 
that  might  be  interesting  to  a  user. 

•  .Alexa  also  keeps  copies  of  what  was  on  the  Web  in  the 
past  to  serve  out-of-print  Web  pages. 

•  Alexa  is  not  a  keyword  search  system;  it  uses  the 
paths  people  have  made  through  the  Web  to  suggest 
similar  Web  sites. 

•  Alexa  is  not  a  directory  (like  Yahoo)  where  a  few 
editors  are  creating  subject  listings.  It  uses  what  many 
people  have  done  to  reinforce  clusters  tliat  make  sense. 

Who  would  administer  a  global  digital  library? 

Everyone.  This  is  our  Web.  Everyone  has  a  role  to  play 
in  reinforcing  the  paths  to  the  quality  materials.  Alexa 
serves  as  a  conduit  and  a  repository  for  usage  paths  and 
content  to  be  used  by  everyone.  We  expect  every 
company,  government,  and  university  to  build  systems 
that  are  similar  in  many  ways  to  what  Alexa  is  building. 

What  criteria  would  determine  the  kind  of  content 
admitted? 

We  gather  all  publicly  accessible  Web  pages  and  revisit 
them  every  six  weeks.  We  do  not  gather  materials  that 
people  have  indicated  they  don't  want  gathered  by 
using  the  "robot  exclusion  standard"  of  the  Net. 


How  would  changing  file  formats  ensure  the 
usability  of  digital  data  over  a  period  of  decades? 

Many  obscure  formats  will  not  be  readable,  but  the 
more  popular  ones  will  be  because  people  want  to  see 
them  again.  We  are  trying  to  help  by  keeping  a  large 
collection  of  materials  accessible  so  that  it  is  worth 
writing  converters. 

Can  digital  archives  spawn  other  technologies  and 
services? 

Imagine  being  able  to  see  a  video  of  your  great  grand- 
father. Imagine  not  panicking  when  your  computer  disk 
crashes.  Imagine  being  able  to  do  research  on  mountains 
of  newspapers  in  a  flash. 

We  are  only  starting  to  understand  what  can  happen 
when  computers  and  global  knowledge  get  intermingled. 

How  will  digital  libraries  seek  to  answer  important 
questions  about  our  culture? 

As  our  cultural  artifacts  go  digital  (music,  writings, 
scholarship,  letters, ...),  then  the  digital  libraries  will  be 
a  place  where  these  are  organised  and  sei-ved  from. 

We  are  looking  for  ideas  and  interested  people  to 
use  the  system  and  to  suggest  where  it  should  go.  « 
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General  Editorial  Liaison: 

Over  the  November-December  period,  the  following  journalists  were  also  contacted 
regarding  general  Alexa  inquiries,  new  angle  pitches,  feature  article  opportunities  and 
follow  up.  These  journalists  have  responded  by  either  writing  about  Alexa  or  expressing 
interest  in  covering  Alexa  in  the  near  future: 


Sari  Kalin 
CIO  Magazine 

Simson  Garfinkel 
Freelance 

Gene  Kaprowski 
Forbes  ASAP 


Howard  Millman 
InfoWorld 

Tim  Jackson 

London  Financial  Times 

Elizabeth  Wasserman 
San  Jose  Mercury  News 


David  Lidsky 
PC  Magazine 
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The  7-Terabyte  Man  Takes  On  The  Storage  Gods 


BREWSTER  KAHLE  HAS  CHALLENGED  THE  STOR- 
age  gods.  Kahle's  company,  start-up  Ale.xa  Internet,  has 
downloaded  and  stored  no  less  than  the  entire  contents  of  the 
public  Internet,  a  staggering  7  terabytes  and  counting,  as  the  basis  for 
its  new  information  business. 

Kahle  has  developed  a  new  Internet  search  engine/gopher  tech- 
nology that  promises  to  deliver  more  reliable  Web  searches.  His  goal; 
Be  the  site  with  the  most  complete  database  so  Web  surfers  always  get 
the  information  they're  looking  for.  No  more  "404  Error:  URL  Not 
Found"  hits  that  pop  up  all  too  often.  Kahle  hopes  revenue  will  be 
generated  by  advertising,  and  the  search  engine  will  be  free  to  surfers, 
much  like  Yahoo!  and  Lycos,  or  any  other  search  engine. 

Of  course,  the  37-year-old  president  of  Alexa  Internet  admits  that 
few  companies  have  such  massive  storage  needs  or  goals.  Neverthe- 
less, he  has  seen  the  future  and  warns  that  the  Internet  will  quickly 
have  an  impact  on  their  re- 
quirements. "Fortune  500 
companies  now  have  the 
number  of  nodes  on  their  in- 
tranets that  the  Internet  had 
five  years  ago,  and  it's  grow- 
ing all  the  time,"  says  Kahle. 

High  Performance 

Kahle's  storage  M.O.  is  to 
buy  as  many  high-speed  disks 
as  he  can  afford,  which  he 
claims  access  data  at  9 
megabytes  per  second,  and 
then  use  DLT  tape  for  the 
rest. 

DLT  costs  about  SIO  to 
SI2  per  gigabyte  mstead  of 
the  SlOO  per  gigabyte  price 
for  hard  disk.  Of  course,  tape 

performance  varies.  If  you're  accessing  data  that's  on  a  tape  where  the 
robotic  arm  is  already  residing,  Kahle  says  you'll  get  a  retrieve  time 
of  about  3  megs  per  second.  If  you  have  to  change  tapes,  though,  it 
can  take  as  long  as  four  minutes  for  the  same  amount  ot  data. 

"I  use  the  tape  to  store  data  on  Internet  sites  that  people  don't 
visit  very  often — like  DOS  community  discussion  sites,"  says 
Kahle.  "For  the  really  popular  site  data,  such  as  search  engines  like 
Yahoo!,  the  information  has  to  be  on  our  hard  disks.  People  can 
surf  to  our  Web  site  and  test  our  new  gopher.  If  it  takes  people  up 
to  four  minutes  to  find  Yahoo!  on  our  system,  they'll  go  away  and 
never  come  back." 

Right  now  1  terabyte  of  Kahle's  data  is  stored  on  high-speed  disks 
from  Quantum  Corp.  and  Seagate.  He  chose  Qiiantum  fust  because 
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it  used  to  have  the  biggest  hard  disks  Kahle  could  find  on  the  mar- 
ket— 9  gigs.  "But  now  we  have  these  brand-new  23-gig  Seagate 
dri\es,  and  they're  working  well,"  he  says.  "And  I'm  a  good  test  if 
something  works  well  because  we  are  so  hard  on  the  technology — 
constantly  accessing.  I  think  both  of  these  disk  drives  are 
bulletproof" 

Kahle  sa_\'s  the  drives  are  built  well  because  they're  intended  for 
the  PC  and  video  industries.  "Even  if  you  don't  go  with  Quantum  or 
Seagate,  go  with  a  brand  catering  to  these  same  industries  and  you'll 
have  a  fairly  safe  bet,"  he  sa\s. 

He  also  buys  Quantum's  DLT  tape  technology,  and  then  uses  hi- 
erarchical storage  management  software  from  Large  Scale  Configu- 
ration Inc.  (www.lsci.com).  The  LSC  software  determines  what 
should  be  stored  on  tape  and  what  should  be  on  hard  disk  for  fast  and 
constant  access.  Kahle  says  LSC  comes  from  the  mainframe  world 

and  understands  the  issues 
surrounding  converting  a 
centralized  storage  system 
into  a  cost-effective 
multitiered  one. 


Full-Time  Service 

"What  the  Internet  brings  to 
the  table  for  us — and  many 
other  companies  with  both 
Internet  and  intranet  Web 
storefronts- — is  that  we're 
open  to  provide  customer 
ser\ice  and  information  24 
hours  a  day,  se\'en  days  a 
week,"  Kahle  says.  "That 
means  access  to  stored  data 
has  to  be  absolutely  opti- 
mized for  what  people  want, 
balanced  with  what  vou  can 
afford." 

Smart  storage  management  is  critical  to  Kahle,  who  says  if  the 
data  now  becoming  available  on  corporate  intranets  is  to  be  of  real 
use,  it  must  be  stored  and  accessed  in  a  meaningful  way. 

For  Kahle,  real  use  means  people  in  organizations  can  learn  from  one 
another  to  share  knowledge  across  departments.  And  Kahle  insists  that 
type  of  storage  management  won't  break  your  budget. 

"Tr\  what  we're  doing  on  the  Internet  on  your  intranet — I 
promise  it  won't  cost  a  million  dollars,"  says  Kahle.  "Go  for  the 
SlOO-per-gig  hard  disks  where  you  can,  and  in  many  cases  you  may 
be  able  to  store  the  whole  thing  on  hard  disk,  at  least  for  now.  The  av- 
erage Web  site  on  an  intranet  or  the  Internet  is  measured  in 
niegab\tes,  so  e\en  100  gigs  worth  of  storage  is  SIO,000."  • 
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Archive  holds  wealth  of  data 


America  is 
full  of  entre- 
preneurs who 
spend  their 
careers  mak- 
ing money 
and  their 
retirements 
disbursing  it 
to  good  causes.  Cases  where 
the  business  comes  after  the 
philanthropy  are  rarer,  but 
that  is  the  story  of  Alexa 
Internet  (www.alexa.com),  a 
company  that  has  developed 
a  useful  tool  for  navigating 
the  web. 

Alexa  is  the  creation  of 
Brewster  Kahle,  who  worked 
on  fast  computers  at  a  com- 
pany called  Thinking 
Machines.  He  founded  a 
company  called  WAIS,  which 
he  sold  to  AOL  in  1995  for 
$15m  (£8.8m). 

Too  young  to  retire,  Mr 
Kahle  started  musing  on  the 
fact  that  the  web  is  tran- 
sient, because  no  trace 
remains  of  a  web  page  once 
its  publisher  has  removed  it. 
With  his  own  money,  he 
built  the  Internet  Archive 
(www.archive.org),  which 
now  contains  eight  terabytes 
of  data.  Since  a  terabyte  is 
1,024  megabytes,  the 
archive's  holdings  are  equiv- 
alent to  about  16m 
full-length  books. 
The   data    are   stored    on 


tapes  in  a  juke-box  system, 
which  allows  any  one  of  the 
world's  135m  web  pages  to 
be  retrieved  in  about  15  min- 
utes. The  archive  is  updated 
by  software  spiders,  which 
crawl  around  the  web 
looking  for  new  pages  and 
adding  them  to  the  archive. 

While  spending  $100,000  of 
his  own  money  building  this 
system,  Mr  Kahle  realised 
the  benefits  of  developing  a 
navigation  tool  to  help 
people  find  what  was  most 
useful  on  the  web.  He  chose 
the  name  Alexa  in  homage 
to  the  great  library  at  Alex- 
andria, one  of  the  wonders  of 
the  ancient  world. 

Alexa  is  a  software 
"agent"  in  the  form  of  a 
small  toolbar  beneath  the 
browser  window.  The  pro- 
gram sends  information  on 
your  behaviour  to  Alexa 
which  can  be  used  to  benefit 
others. 

The  toolbar  keeps  track  of 
which  page  is  being  dis- 
played in  the  web  browser 
window  and  offers  informa- 
tion related  to  that  page.  By 
clicking  one  button  on  the 
Alexa  toolbar,  you  can 
answer  the  question  "Wliere 
am  I'.''".  A  window  appears 
showing  the  popularity  of 
the  site,  details  of  the  web 
site's  owner  and  statistics  on 
the  site's  updates. 


The  toolbar  also  suggests 
where  you  should  go  next, 
using  a  number  of  methods. 
One  is  to  track  usage  paths. 
The  program  keeps  track  of 
the  sites  you  visit  in  each 
session  and  how  long  you 
spend  there,  and  relays  this 
information  to  Alexa.  The 
software  gives  greater 
weight  to  pages  that  are 
viewed  for  longer,  on  the 
plausible  grounds  that  they 
are  likely  to  be  more  useful. 

Another  approach  is  link 
analysis.  The  Alexa  spiders 
find  sites  that  recommend 
the  page  you  are  looking  at. 
and  considers  other  sites  to 
which  they  link  as  possibly 
related.  A  third  approach  is 
content  analysis. 

Using  these  methods,  the 
Alexa  toolbar  creates  four 
links  to  sites  that  may  be 
related  to  the  current  page. 

Even  though  the  Alexa  cli- 
ent activates  six  processes 
on  the  company's  server 
every  time  the  browser 
shows  a  new  page,  Mr  Kahle 
says  the  computing  power 
needed  for  the  continuous 
updates  is  modest. 

Mr  Kahle  hopes  to  turn 
this  into  a  lucrative  advertis- 
ing vehicle.  From  the  compa- 
ny's point  of  view,  its  key 
advantage  is  that  the  toolbar 
stays  open  for  hours  as  users 
surf  the  web,  allowing  Alexa 


to  sell  far  more  ad  "impres- 
sions" than  even  the  most 
popular  web  site. 

Alexa  can  "slice  and  dice" 
its  customer  base  because  it 
asks  users  for  demographic 
data  such  as  residence,  age, 
job  and  sex.  Advertisers  can 
buy  toolbar  ads  that  are 
shown  only  when  the  user  is 
looking  at  a  particular  web 
site.  This  lets  Alexa  sell  site- 
specific  ads  without  the  site 
owner's  involvement. 

So  far,  Mr  Kahle  and  his 
colleagues  have  not  sold  a  ' 
single    ad.    They    want    to 
avoid  putting  users  off  by  ' 
making  the  ads  too  obtrusive , 
at  the  beginning. 

But  there  is  another  rea-  ^ 
son  for  their  hesitation.  Two  ! 
months  since  the  product's  ) 
launch,    the    software    has 
been  downloaded  by  75,000 
people.  Microsoft  and  Net- 
scape often  achieve  similar, 
download  numbers  in  a  mat-  ■ 
ter  of  days. 

Mr  Kahle  says  his  down-' 
load  target  is  Im  copies  by 
the  end  of  1998. 

If  Alexa  cannot  achieve 
enough  users,  others  will 
imitate  or  incorporate  its 
best  ideas.  The  company 
needs  some  marketing  tools 
to  encourage  other  web  sites 
to  promote  its  product. 

tim.jacksoniwpobox.com        ./- 
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Rewevv  ofAlexa  Internet  Software 

)      These  are  my  impressions  ofAlexa  version  1.2.  If  you  have  an  earlier  version,  you  should 
upgrade  to  version  1.2.  If  you  have  version  1.2,  future  updates  should  be  automatic,  according 
to  Alexa.  I  am  assuming  that  you  have  already  read  the  overview,  quick  tour  and  FAQ  on  the 
Alexa  web  site.  They  are  well-written  and  will  answer  many  questions  about  Alexa's  features.  If 
you  are  the  sort  of  person  who  usually  ignores  documentation,  you  should  make  an  exception 
in  Alexa's  case.  A  collection  of  articles  about  Alexa  is  available. 

I  downloaded  and  installed  Alexa  with  no  problems. 

Alexa  worked  with  Netscape  3  and  4  and  Microsoft  Internet  Explorer  3  and  4. 

Alexa  places  an  icon  (a  lower-case  "a"  in  a  blue  circle)  in  your  taskbar's  tray~the  tray  is  on  the 
far  right  side  of  the  taskbar,  next  to  the  display  of  the  current  time.  If  you  have  closed  Alexa 
and  want  to  re-open  it,  click  on  the  icon. 

The  Alexa  toolbar  can  be  dragged  around  the  screen~l  prefer  putting  it  between  the  bottom  of 
the  browser  window  and  the  Windows  95  task  bar.  There  is  an  Autohide  option  that  will  keep 
the  toolbar  hidden  until  you  want  it,  thereby  giving  your  browser  window  more  room  on  the 
screen.  If  you  use  large  buttons  in  your  browser  toolbar  (I  use  small  text-only  buttons  in 
Netscape)  and  you  use  640x480  screen  resolution,  you  may  find  that  you  must  use 
Autohide-otherwise,  if  you  have  the  Alexa  toolbar  displayed  along  with  all  the  other  bars  on 
the  page  and  you  are  viewing  a  web  page  with  frames,  you  may  have  little  room  left  in  your 
browser  window! 

Help  is  launched  by  clicking  on  the  extremely  tiny  question  mark  at  the  lower  right  side  of  the 
Alexa  toolbar.  Help  can  also  be  launched  by  clicking  on  the  lower  case  "a"  on  the  left  side  of 
the  Alexa  toolbar,  which  reveals  a  menu  that  includes  help,  technical  support  Q&A  and  other 
options. 

The  "Where  You  Are"  menu  gave  me  a  lot  of  information  that  1  usually  didn't  need  to  know. 
Maybe  others  will  find  it  to  be  more  valuable.  The  "View  Site  Map"  option  on  the  "Where  You 
Are"  menu  is  useful  if  you  want  a  "road  map"  of  what  a  site  looks  like;  however,  it  doesn't  give 
you  every  page  on  a  server-it  only  gives  you  some  of  the  pages  that  branch  off  from  a 
domain's  main  page.  Alexa  uses  the  SiteMap  site  to  provide  site  maps;  SiteMap  has  some 
interesting  features. 

I  wonder  how  many  Alexa  users  will  bother  to  use  the  "Vote"  button.  Unless  people  use  it,  the 
like/dislike  ratings  won't  be  meaningful.  A  drawback  of  the  "Vote"  feature  is  that  the  votes  will 
be  irrelevant  in  some  cases.  If  someone  goes  to  Geocities  and  doesn't  like  a  member's  page, 
it  will  count  as  a  "dislike"  vote  for  Geocities  as  a  whole,  but  that  vote  has  no  relevance  to  any 
of  the  other  Geocities  member  pages. 

Alexa  has  small  ads  on  the  toolbar. 

The  "Where  To  Go  Next"  list  has  up  to  four  choices  on  the  toolbar-clicking  the  right-pointing 
)      circled  arrow  just  to  the  right  of  the  "vote"  button  gives  more  choices.  "Where  To  Go  Next" 
usually  gave  useful  suggestions,  but  occasionally  the  suggestions  seemed  worthless  or 
inappropriate.  For  example,  when  I  went  to  a  page  on  the  www.sierra.net  server,  Alexa 
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suggested  a  bunch  of  links  to  Sierra  Leone.  Alexa's  suggestions  will  get  better  as  more  people 
use  Alexa  and  Alexa  gathers  more  data. 

The  archive  icon  (it  looks  like  a  pillared  building)  turns  into  a  little  piece  of  paper  if  you  go  to  a 
web  page  for  which  Alexa  has  an  archived  copy.  If  you  go  to  a  page  that  gives  you  a  "Not 
Found"  message,  such  as  "File  Not  Found",  "404  Not  Found"  or  "HTTP/1.0  404  Object  Not 
Found",  wait  a  couple  of  seconds  to  see  whether  the  little  piece  of  paper  appears.  If  it  does, 
you  can  click  on  the  archive  icon  and  Alexa  will  fetch  its  archived  version  of  the  page.  A  page 
will  appear  that  says: 

Your  page  is  being 
fetched  from  our  Archive. 

You  will  be  notified  by  Alexa 
when  it  is  ready  to  be  viewed. 

Because  the  data  must  be  fetched  from  our  tape 
robot,  this  process  takes  time.  The  normal  wait  for  a 
page  and  all  of  its  images  is  about  15  minutes,  but 
complicated  pages  may  take  considerably  longer. 

You  may  continue  with  your  normal  use  of  your 
computer  while  the  page  is  being  retrieved,  but  please 
leave  Alexa  running  so  that  it  can  notify  you  when  your 
page  is  ready. 

When  Alexa  has  retrieved  an  archived  page,  a  window  appears  on  your  screen  asking  you 
whether  you  want  to  look  at  the  archived  page. 

f  To  test  the  Alexa  archive,  I  looked  through  my  bookmarks  for  "Not  Found"  pages  and  I  also 

used  AltaVista's  advanced  search  function  to  find  other  peoples'  bookmark  pages  that  hadn't 
been  updated  since  early  1996.  Most  of  the  "Not  Found"  pages  I  encountered  were  in  the 
archive.  This  may  be  partly  due  to  the  fact  that  some  of  those  "Not  Found"  pages  were 
personal  pages  that  were  probably  not  on  the  web  for  very  long  or  were  seldom  visited  while 
they  were  on  the  web.  If  you  are  looking  for  a  "Not  Found"  page  that  was  once  popular,  it  is 
probably  more  likely  that  it  will  be  in  Alexa's  archive. 

An  archived  page  retrieved  by  Alexa  doesn't  have  the  same  URL  as  the  "Not  Found"  page.  For 
example,  when  I  asked  Alexa  for  the  follwing  "Not  Found"  page: 

http://www.hyperion.com/~koreth/particles/ 

the  URL  of  the  archived  page  was 

http://widener.alexa.com/http/www.hyperion.com/80/~koreth/particles/jajndex.html 

A  day  later,  I  was  able  to  retrieve  the  above  page  immediately  by  using  the  longer  URL.  This 
was  not  because  my  browser  had  cached  the  page.  It  was  undoubtedly  available  because 
Alexa  had  cached  it.  An  Alexa  technical  support  person  told  me  that  the  Alexa  cache  holds 
many  gigabytes  of  frequently  requested  data. 

Alexa  gives  you  the  most  recently  archived  version  of  an  archived  page.  This  is  not  always 
useful,  because  conscientious  webmasters  often  put  up  a  placeholder  before  they  remove  a 
page  completely.  For  example,  Alexa  retrieved  one  archived  page  that  had  only  the  following 
text: 
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This  link  is  gone! 
Please  remove  bookmarks  to  it 


If  Alexa  could  give  me  an  earlier  archived  version  of  that  page,  it  would  have  been  more  useful 
to  me.  (On  the  other  hand,  if  the  webmaster  put  up  a  link  to  the  new  location  before  the  page 
was  taken  down,  the  archived  page  would  link  to  its  new  location.) 

As  another  test,  I  tried  to  find  an  old  Simpsons  page.  The  people  behind  the  TV  show  "The 
Simpsons"  have  threatened  legal  action  against  some  people  who  created  Simpsons  pages.  I 
found  a  "Not  Found"  Simpsons  page  that  appeared  to  be  archived,  but  I  got  the  following 
message  from  Alexa: 

The  object  you  requested  is  not  in  the  Archive. 
We  will  not  be  able  to  fulfill  your 
request . 

The  probable  cause  for  this  error  is  that  the 
authors  of  the  page  requested  its  removal 
from  the  Archive. 

Such  requests  are  honored  immediately,  but 
there  is  a  delay  before  all  databases  are 
updated  to  reflect  the  deletion.  So  Alexa  will 
continue  to  indicate  the  availability  of  the 
page,  for  some  time  after  its  removal. 

In  a  framed  site,  Alexa  won't  tell  you  whether  a  particular  frame  is  archived  unless  you  open 
the  frame  as  a  separate  page. 

It  is  not  possible  to  retrieve  an  archived  page  that  is  from  a  domain  that  no  longer  exists,  such 
as  the  once-popular  www.gnn.com.  You  can't  get  a  "Not  Found"  message  because  the  server 
itself  isn't  connected  to  the  Internet  any  more.  An  Alexa  technical  support  person  said  they  are 
working  on  a  method  to  permit  users  to  access  a  page  from  a  domain  that  no  longer  exists, 
but  it  is  not  currently  possible. 

The  EB  icon  launches  a  window  that  allows  you  to  search  a  dictionary,  a  thesaurus,  and  a 
concise  version  of  the  Encyclopedia  Britannica.  The  full  Encyclopedia  Britannica  is  also  an 
option  in  that  window,  but  you  may  use  the  full  Encyclopedia  Britannica  only  if  you  have  paid 
the  Encyclopedia  Britannica  subscription  fee. 

Alexa  has  the  most  astonishingly  responsive  technical  support  1  have  ever  seen.  They  have 
replied  to  my  questions  the  same  day  and  the  replies  have  not  been  form  letters.  I  wish  other 
software  and  hardware  companies  had  tech  support  like  that! 

A  friend  of  mine  told  me  he  installed  Alexa  but  he  doesn't  use  it.  He  said  he  didn't  like  the 
clutter  on  his  screen  and  he  thought  it  was  like  PointCast  in  the  sense  that  it  was  software  that 
initially  seemed  interesting  but  later  became  an  annoyance.  He  admitted  that  he  hadn't  read 
the  documentation  and  that  he  had  been  unaware  of  some  of  Alexa's  features. 

Send  questions  and  comments  to  scruffv@prairienet.orq.  If  I  made  any  mistakes  in  this  page, 
please  help  me  correct  them. 


scruffy  home 
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With  IE  4.0  in  wide  release, 
many  users  have  gotten  a 
ghmpse  at  Microsoft's  cliosen 
way  to  bring  the  Internet  to 
your  desktop.  What  the 
company  forgot,  in  my 
opinion,  is  some  of  the 
substance  serious  surfers  need 
to  complete  their  browsing 
experience.  Alexa  Internet  is  a 
free  sei"vice  that  fills  the  gap, 
helping  you  surf  smarter,  not 
harder. 
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Alexa  displays  a  toolbar  that 
works  with  your  browser, 
providing  a  continuous  source 
of  relevant  recommendations 
on  where  your  next  stop  on  the 
Web  should  be.  This  toolbar  is  displayed  over  your  browser's 
status  bar.  Information  in  the  Alexa  bar  is  site-  and 
context-sensitive,  changing  as  you  surf  from  site  to  site. 
Available  infomiation  for  a  given  site  includes  who  owns  the 
site,  how  many  pages  it  has,  how  frequently  the  site  is 
updated,  and  the  site's  popularity  among  other  Alexa  users. 
The  latter  infomiation  is  provided  by  giving  users  the  ability 
to  vote  on  whether  they  like  or  dislike  a  particular  site.  Alexa 
also  provides  relative,  targeted  recommendations  of  where  to 
go  next. 

Using  Alexa' s  network,  you  can  see  which  of  your  friends  are 
online  and  send  them  messages  in  real  time,  perfonn  research 
on  the  Web  through  your  complete  desktop  reference  tool,  and 
access  the  Encyclopaedia  Britannica  Online  and 
Merriam- Webster's  Dictionary  and  Thesaurus. 

Alexa  offers  some  very  valuable  information,  although  you 
may  be  overwhelmed  at  first  with  it's  abundant  set  of  features. 
Considering  the  improvements  that  have  been  made  to  this 
app  since  we  first  featured  a  review  of  the  beta  version,  you 
can  look  for  more  good  things  from  Alexa  in  the  future.  - 
Clinstopher  Spera  and  Kem/  R.  Kmeger 
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Small  Firms  Turn  to  Web 
To  Boost  Market  Presence 

By  RIVKA  TADJER 

Special  to  The  Wall  Street  Journal  Interactive  Edition 

What's  in  a  brand  name?  For  companies  seeking  to  turn  a  profit, 
everything. 

But  it's  also  expensive  to  transform  a  company  name  into  a  household 
word.  So  small  businesses  with  invariably  tight  advertising  and 
marketing  budgets  are  turning  to  the  Web  for  a  unique  opportunity  to 
leverage  bigger  companies'  market  presence  by  buying  ad  banner 
space  on  their  sites.  Call  brand-building  on  the  Web  the  business 
version  of  social  climbing. 


B 


Links 

k   iJoin  an  on-line 
*  Idiscussion  about 
using  the  Web  for 
brand-building. 

Wharton's 
Pennsylvania  Small 
Business 

Development  Centers 
www.libertynet.org/pasbdc/ 

Alexa  Internet 
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Cirque  Corp. 
www.cirque.com 

Kraft  Foods 
wvtfw.Kraftfoods.com 

Tripod 
www.tripod.com 

IVillage 
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"Striking  strategic  alliances  on  the 
Web  and  trying  to  leverage  the 
brands  of  bigger  entities  makes  a  lot 
of  sense;  that's  why  everyone  is 
trying  to  get  in  with  America  Online 
or  put  a  banner  ad  on  Yahoo!  or 
Barnes  &  Noble,"  says  Paul  Morin, 
director  of  the  Wharton  Small 
Business  Development  Center. 
"Remember  that  the  only  way  to  take 
a  small  business  out  of  obscurity  and 
into  profitability  is  by  building  a 
brand  name.  Today  that  means 
advertising  in  the  big  four  —  TV, 
radio,  print  and  on  Web 
communities." 

While  TV,  radio  and  print 
communicate  with  the  consumer  in 
only  one  direction,  the  Web  allows 
for  an  interactive  exchange.  The 
approach  companies  take  to  building 
a  brand  on  the  Web  should  take 
advantage  of  that  difference,  Mr. 
Morin  says.  "This  means  exploiting 
what  the  Web  medium  can 
accommodate  ~  use  audio,  video, 
interactive  e-mail,  whatever  it  takes 


for  people  to  buy  into  whatever  you're  selling  and  come  back  to  your 
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company. 

What  the  Web  does  best,  Mr.  Morin  says,  is  reach  a  target 
demographic  of  males  from  the  ages  of  15  to  30,  and,  to  a  lesser 
degree,  30-  to  40-year-olds.  "These  are  the  people  using  the  Net  all 
the  time,"  he  says.  "And  while  there  are  an  increasing  number  of 
women,  target  your  interactive  campaigns  to  get  as  many  of  this 
demographic  profile  involved  with  your  company  as  possible." 

Some  brand-building  veterans  say  the  brand  name  is  even  more 
important  than  the  product.  Small  businesses  cooking  up  Web 
strategies  may  do  well  to  take  a  page  from  this  Madison  Avenue 
school  of  thought. 

"Your  brand  name  defines  the  relationship  consumers  have  to  your 
company,  not  just  your  product.  If  that  brand  name  is  good  and  the 
relationship  a  loyal  one,  you  can  build  your  business  with  a  variety  of 
successful  products  and  services,"  says  Thomas  Quarton,  president  of 
Cirque  Corp.,  the  Sah  Lake  City-based  start-up  company  that 
invented  the  technology  for  Apple  Computer's  Powerbook  touch-pad 
pointing  device  (called  the  GlidePoint),  which  replaced  the  trackball. 
In  the  five  years  since  Cirque  was  founded,  it  has  taken  command  of 
almost  50%  of  the  touch-pad  marketplace. 

In  17  years  of  consumer  marketing,  Mr.  Quarton  has  built  brand 
names  for  Clearasil,  M&M  Mars,  Marriott  Corp.,  and  Vidal  Sassoon. 
But  getting  the  word  out  on  Cirque  is  the  first  time  Mr.  Quarton  has 
felt  how  much  of  a  difference  the  Internet  makes.  "People  buy  based 
on  the  relevance  your  company  has  to  their  lives  overall,  not  just 
because  of  one  product  or  another.  That's  what  brand  name  really  is," 
he  says.  "The  Internet  lets  us  directly  interact  with  consumers  ~ 
showing  demos  of  our  product,  amusing  people  with  games  on  our 
Web  site,  having  conversations  over  e-mail,  being  there  live  for  them 
in  as  many  ways  as  we  can  invent,"  he  says. 

"You  want  to  know  my  favorite  Web  site?  Kraft  Foods,"  says  Mr. 
Quarton.  "I  think  it  does  the  best  job  of  inviting  consumers  to  have  a 
relationship  with  the  company,  not  just  one  product  Kraft  sells.  My 
wife  and  I  constantly  go  into  their  recipe  section  and  find  great  stuff 
Kraft  has  pulled  off  the  tough  task  of  creating  its  own  interactive 
community." 

Since  most  smaller  businesses  don't  have  the  funding  to  create  their 
own  communities,  they  are  advertising  on  the  sites  of  existing 
communities  to  build  their  own  brand  names.  Banner  advertisements 
on  computer-community  Web  sites  such  as  ZDNet  are  central  to 
Cirque's  Web  campaign  because  they  target  Cirque's  audience  in  an 
efficient  way,  Mr.  Quarton  says.  "We  advertise  in  a  bunch  of  the 
computer  print  magazines  that  Ziff  publishes,  such  as  PC  Magazine 
and  Family  PC  ~  a  hon's  share  of  our  $1  million  print  ad  budget  is 
for  Ziff—  but  our  $15,000  banner  ad  on  ZDNet  reaches  readers  of  all 
those  magazines  at  once."  ZDNet's  fee  buys  Cirque  one  fiscal  quarter 
of  advertising.  Mr.  Quarton  also  places  a  banner  ad  ~  at  the  same 
price  ~  on  the  Yahoo!  search  engine  site  to  reach  less  tech- focused 
Nefizens. 

If  a  business  can  only  afford  to  do  one  brand-building  program,  Mr. 
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Quarton  says,  demonstrating  prowess  of  the  product  at  the  point  of 
sale  is  the  way  to  go.  "And  if  your  product  or  service  can  be 
demonstrated  and  sold  on  the  Internet,  it  may  be  the  least  expensive, 
most  efficient  way  you'll  find." 

Netizens,  Mr.  Quarton  says,  are  a  curious  and  willing  universe  of 
prospects,  and  businesses  can  find  out  more  on-line  about  what  those 
prospects  think  than  is  possible  fi"om  TV  or  an  ad  in  a  magazine.  A 
registration  form  Netizens  fill  out  for  a  contest  can  be  a  well  of 
information.  And  the  contest  is  good  public  relations  for  the  whole 
company  —  not  just  a  single  product. 

Bruce  Gilliat,  general  manager  of  a  San  Francisco  start-up  called 
Alexa  Internet,  says  brand-building  on  the  Web  offers  another 
advantage  to  folks  in  his  boat  ~  the  "25  employees  but  no  revenues 
yet"  situation:  "We  started  our  company  and  Web  site  in  July,  and 
even  though  we  didn't  have  money  for  banner  ads,  we  could  swap  ads 
with  other  companies  on  the  barter  system  to  create  a  virtual 
community,"  he  says 

Companies  such  as  Williamston,  Mass.-based  Web  developer  Tripod 
and  Ivillage:  The  Women's  Network  swapped  ads  with  Alexa. 

Alexa's  flagship  product  is  an  Internet  search  service.  Alexa  staffers 
have  downloaded  the  contents  of  the  public  Web  into  a  database  and 
analyzed  the  sites.  They  keep  updating  their  enormous  database  all 
the  time.  Owner  Brewster  Kahle  claims  that  the  free  Alexa  service 
can  find  any  site  on  the  Web,  even  if  other  search  engines  fail  at  the 
task  and  simply  pop  up  with  one  of  those  pesky  "Error  404:  URL  Not 
Found"  messages.  Alexa's  slogan  is  "Where  am  I  and  where  do  I  want 
to  go  next?" 

Mr.  Gilliat  says  all  of  Alexa's  brand-building  focuses  on  boosting  the 
company's  reputation,  not  just  the  Web-search  product  slogan.  With 
$1.3  million  in  seed  capital  from  investors,  the  few  paid  marketing 
moves  Alexa  has  made  so  far  include  spending  $100,000  on  a  launch 
party  in  the  Bay  Area,  holding  a  press  conference,  and  participating 
in  an  Internet  trade  show  in  Chicago. 

Mr.  Gilliat  says  the  company  also  spent  close  to  $20,000  for  a 
designer  to  make  sure  that  everything  ~  the  Web  site,  the  banner  ads, 
the  brochures,  everything  with  the  Alexa  name  on  it  —  has  precisely 
the  same  look.  "That  unified  look  is  key  to  brand-name  recognition," 
says  Mr.  Gilliat. 

Next  on  Alexa's  agenda  ~  when  the  company  starts  generating 
revenue  from  advertisers  ~  is  probably  TV  advertising  on  cable 
stations  such  as  CNNfn  and  CNBC.  "And  all  the  ads  will  have  our 
URL  and  slogan  at  the  end." 

Businesses  trying  to  tie  TV,  print  and  radio  commercials  back  to  the 
Internet  should  use  the  URL  as  part  of  the  brand  character  itself, 
Wharton's  Mr.  Morin  says.  "Generally,  when  you  go  for  branding,  go 
for  the  community  with  the  largest  target  audience  based  on  what  you 
can  afford,"  he  says. 

The  trick  with  Web-community  deals,  though,  is  avoiding  being 
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taken  advantage  of  by  the  big  brand.  "They  hold  the  cards,  because 
there  are  only  so  many  big  companies  on  the  Web,  and  they  have  a 
huge  universe  of  small  companies  to  choose  from,"  Mr.  Morin  says. 

But  it  is  possible  to  succeed  in  deals  with  large  companies.  Long 
Island,  N.Y. -based  1-800  Flowers  cut  a  deal  with  AOL  earlier  this 
year  that  was  expensive  and  negotiated  without  any  standard 
measure.  It  took  a  reported  $25  million  for  1-800-Flowers  to  talk 
AOL  into  making  the  florist  the  only  florist  in  AOL's  popular 
Shopping  Channel  for  the  next  four  years.  There  are  no  standardized 
prices  for  these  sorts  of  deals,  AOL  officials  say. 

Donna  lucolona,  director  of  interactive  services  division  at 
1-800-Flowers,  says  there's  a  better  way  to  gauge  value  than  standard 
pricing:  "If  you  can  make  back  the  investment  in  the  first  year,  the 
price  doesn't  matter,"  she  says.  If  the  only  florist  the  10  million  AOL 
account  holders  see  every  time  they  enter  the  Shopping  Channel  on 
AOL  is  1-800-Flowers,  Ms.  lucolona  says  her  brand  name  goals  have 
been  accomplished. 
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Interview:  Yahoo's  Tim  Koogle 


SIDEBARS 


December  01,  1997 
By  Richard  L.  Brandt 

Yahoo  just  may  be  the  most  visited  site  on  the 
Internet.  Every  day,  perhaps  a  miUion  people 
view  some  50  million  pages  on  Yahoo,  making  it 
one  of  the  few  profitable  Web  sites  on  the 
Net— from  ad  sales,  no  less!  CEO  Tim  Koogle 
tells  how  it's  done. 
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CNET's  Halsey 
Minor 
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Upside:  What  brings  so  many  people  to  your  site? 

Koogle:  A  number  of  things.  The  original  premise  was  that 
of  bringing  functionality  to  the  Web,  of  providing  a 
navigational  guide  to  a  broad  array  of  users.  We  wanted  to 
be  the  only  place  you  had  to  remember  to  come  back  to,  to 
find  access  to  all  kinds  of  content,  and  now  all  kinds  of 
goods  and  services  as  well.  We  aggregate  content  from 
different  sites.  Doing  that  created  a  platform.  That  platform 
ends  up  looking  like  a  gateway,  or  a  hub,  so  now  tens  of 
millions  of  people  come  through  that  hub  monthly. 

How  do  you  know  what  people  want?  We  use  the 

consumption  patterns  of  our  visitors  to  augment  and 
extend  the  platform  globally. 

But  you  use  human  brains,  rather  than  technology,  to 
choose  new  links?  Yeah.  It's  important  to  separate  the 
different  means  of  content  delivery  from  the  structure  of 
our  sites. 

Full-text  search  engines  are  an  automated  way  of  guiding 
people  toward  content.  People  enter  a  word  or  a  phrase  and 
hit  "search,"  and  a  search  engine  goes  and  searches  across 
a  compressed  index  of  text  documents,  which  have  been 
gathered  by  a  crawler  [an  automatic  software  engine  that 
gathers  information  about  sites  across  the  Web].  That's  one 
way  of  connecting  people  with  content. 

Another  way,  which  was  our  starting  point,  was  to  build  a 
directory,  a  hierarchy  where  we  aggregate  sites  around 
subject  themes.  We  build  a  table  of  contents,  if  you  will, 
and  allow  users  to  visually  browse  through  that  and  be 
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guided  by  it. 

We've  done  both.  The  way  we  figure  out  what  people  want 
in  deeper  aggregation  and  the  way  we  structure  our 
properties  is  part  automated  and  part  editorial.  We  have  a 
lot  of  tools  that  we've  built  in-house  to  take  all  the  massive 
usage  files  that  we  generate  daily  and  reduce  that  and 
extract  the  fundamental  key  metrics.  We  also  have  internal 
tools  that  allow  us  to  make  use  of  that  and  deliver  ads  on  a 
targeted  basis. 

But  when  it  comes  right  down  to  it,  people  who  have  the 
sensibilities  of  what  is  the  most  logical,  easy,  fun,  quick 
way  of  guiding  people  toward  content  make  those 
decisions. 

We  seem  to  be  so  fascinated  with  technology,  smart 
agents,  automatic  search  engines,  natural  language 
programs  and  other  artificial  intelligence  programs, 
but  we  forget  that  stuff  doesn't  work  well  yet.  Thafs 
right. 

What  do  you  think  of  Brewster  Kahle's  approach  [at 
Alexa  Internet]  of  trying  to  lead  people  to  new  sites  that 
may  be  of  interest  to  them  by  showing  them  where 
other  users  go  after  visiting  the  same  site?  That's  one  of 
a  number  of  different  efforts  that  various  folks  on  the  Web 
have  mounted,  trying  to  detect  user  patterns  dynamically 
and  use  that  to  either  automatically  or  non-automatically 
suggest  to  them  places  they  should  go.  This  whole 
business  of  collaborative  filtering  and  the  rest  has  been 
tried,  and  none  of  it  has  worked  super-well.  As  you  know, 
we've  had  a  relationship  with  [intelligent-agent  company] 
Firefly,  and  we've  experimented  with  collaborative 
filtering. 

The  question  is  whether  users  would  take  the  time  to  enter 
in  some  of  their  preferences,  to  kick-start  these  things. 
Then  whether,  after  using  them  for  a  while,  the  users 
perceive  enough  value  in  them.  Thirdly,  whether  users 
have  some  sense  that  they  may  be  missing  something 
because  something  else  is  making  the  choice  for  them. 
We've  gotten  input  from  almost  all  the  users  in  our  focus 
group  studies  that  there's  a  lingering  concern  that  it 
excludes  things  they  may  want. 

There's  still  a  technology  problem?  I  think  so.  It's  too 
early. 

Network  Boss? 

When  the  Web  first  arrived  and  first  became  a 
publishing  medium,  we  all  knew  it  would  not  be  like 
television,  newspapers  or  magazines— it  would  be  its 
own  form.  Is  Yahoo  becoming  that  new  medium? 

Yahoo  is  not  becoming  the  medium.  The  Internet  is  the 
medium.  I  have  always  thought  that  the  Internet  was  a 

2  of  7  12/7/97  10:46  PM 


Interview:  Yahoo's  Tim  Koogle  http://www. upside. com/texis/mvnv'.sU)ry?id347b590aO 

fundamental  shift  in  the  way  people  are  able  to  access 
content  and  communicate  with  one  another.  You  can  boil 
^^  that  all  down  into  a  communication  platform,  but  it's  one 

in  which  people  communicate  and  share  content  with  one 
another.  That  content  could  be  as  simple  as  an  e-mail  or 
more  detailed  author  content. 

What  Yahoo  has  been  from  day  one  is  a  media  company, 
in  the  context  of  the  Internet  as  a  medium. 

If  the  Internet  is  the  medium  like  television,  you're  like 
one  of  the  broadcast  stations.  Yeah,  those  analogies  have 
been  drawn  before—that  what  we  are  building  is  a 
successful  broadcast-media  company. 

Is  your  goal  to  be  the  primary  starting  place  for  people 
going  onto  the  Internet,  or  one  of  a  few  big  network 
companies  from  which  people  would  choose?  There's 
more  than  enough  room  for  several  entry  points  onto  the 
Web.  In  fact,  the  Web  is  less  controlled  than  any  broadcast 
medium  of  the  past,  where  people  could  buy  a  spectrum  in 
a  given  geographical  region  and  basically  control  the 
onramps  into  their  broadcast  medium.  There  are  hundreds 
of  thousands,  if  not  millions,  of  entry  points  onto  the  Web. 
We  have  always  intended  to  remain  a  hub  or  a  gateway. 

The  online  world  started  with  services  such  as 
CompuServe  and  America  Online,  which  offer  content 
on  their  proprietary  networks.  But  once  the  Web  took 
off,  these  companies  started  being  displaced  by  the 
Web.  Are  you,  rather  than  MSN  [Microsoft  Network), 
the  Web  replacement  for  AOL?  Some  people  are  saying 
that.  We  live  our  days  here  at  Yahoo  trying  to  figure  out 
how  to  satisfy  our  users'  needs,  to  deliver  as 
comprehensive  a  set  of  content  as  we  can  possibly  build 
for  as  wide  a  customer  base  as  possible  globally,  and  add 
communications  and  community  services.  The  fact  that 
those  same  things  apply  to  a  well-done  proprietary  online 
service  leads  people  to  try  to  draw  direct  analogies 
between  us. 

America  Online  has  been  creating  more  of  its  own 
in-house  content.  Is  that  also  an  appropriate  direction 
to  take  for  a  company  such  as  Yahoo?  We  don't  intend 
to  do  that.  There's  plenty  of  good-quality  content  on  the 
Web. 

Can  proprietary  online  systems  survive?  Yes.  There  is  a 
segment  of  the  world's  population  whose  needs  are 
well-served  by  a  proprietary  service.  There  is  a  level  of 
safety  and  ease  and  stability.  Whether  it's  large  enough  to 
build  a  big  business  around,  with  the  right  economics, 
remains  to  be  seen.  It  requires  getting  to  a  higher  level  of 
scale  because  you  carry  a  higher  cost  base. 

No  Spam! 
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What's  your  position  on  exclusive  partnerships  with 
content  creators?  In  general,  we  don't  believe  in  exclusive 
relationships.  People  have  asked  us  why  we  don't  require 
exclusives  from  content  vendors.  In  general, 
single-customer  businesses  don't  work.  For  us  to  require 
exclusivity  by  any  of  our  vendors  means  we  would  heavily 
limit  their  ability  to  establish  and  maintain  a  viable 
business.  I  won't  do  that. 

On  the  inward-bound  side  of  things,  sometimes  content 
suppliers  or  commerce  partners  request  exclusives  with  us. 
It's  tricky  because  we  have  a  clear  position  that  we  don't 
limit  the  availability  of  content. 

We  do  limited-term  things,  things  we  call  "runway 
projects,"  where  we  carve  off  an  initial  period  during 
which  we  don't  work  with  many  others  [to  launch  the 
project].  But  then  that  expires. 

GTE  recently  sued  Yahoo  and  five  Baby  Bells,  claiming 
you  offer  exclusive  access  to  their  Yellow  Page  services, 
which  cuts  out  GTE's  own  competing  Yellow  Pages. 
Isn't  that  an  exclusive  relationship?  What's  going  on 
there?  I'll  say  two  things  about  that.  First,  GTE  was 
offered  the  ability  to  be  there  on  the  Net  and  turned  it 
down.  So  their  position  puzzles  us.  The  second  is  that 
users  are  not  at  all  excluded  from  getting  to  them.  You  can 
get  to  GTE's  Yellow  Pages  from  a  number  of  points  in 
Yahoo  and  on  Netscape  [NetSearch].  Go  figure. 

You  also  have  relationships  with  Compaq  and 
Gateway,  so  that  a  Yahoo  icon  appears  on  the  desktop 
within  Microsoft's  Internet  Explorer.  Do  you  have  any 
kind  of  exclusivity  in  that?  Can  they  put  a  search 
engine  or  another  navigation  site  on  their  desktops  as 
well?  There  are  levels  of  exclusivity.  There  are  different 
parts  to  our  relationships  with  them.  We  are  the  exclusive 
featured  directory  provider,  but  they  can  have  other 
navigation  companies  cofeatured. 

So  while  you  don't  go  after  exclusivity,  you  don't  mind 
trying  to  negotiate  yourself  a  premier  status.  You  bet. 

It  seems  as  though  there  is  also  some  value  to  users  to 
have  you  make  an  editorial  judgment  about  which 
service  or  content  is  better.  To  some  extent  you  have  to 
do  that,  don't  you?  Yeah.  We're  careful  about  those  sorts 
of  things,  though,  because  our  users  vote  and  show  us  what 
they  find  most  compelling. 

How  do  you  strike  a  balance  between  users'  choices  and 
your  choices?  When  you  go  to  www.yahoo.com  and  go 
through  the  subject  branches  of  the  tree,  you  get  to  one 
with  a  listing  of  sites.  You'll  always  find  that  those  are 
alphabetically  arranged. We  want  to  stay  unabridged  and 
unbiased.  You  also  see  the  "cool"  sunglasses  by  some 
sites,  indicating  the  picks  of  the  day.  These  are  editorial 
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judgments. 

\  You  have  done  a  great  job  creating  the  Yahoo  brand. 

Some  people  say  your  brand  mainly  appeals  to  people 
first  venturing  onto  the  Internet.  Do  you  see  yourself  as 
a  starting  place  for  beginners?  We're  more  broad-based, 
and  the  data  shows  it.  About  60  percent,  and  sometimes 
more,  of  our  traffic  comes  from  people  accessing  Yahoo 
from  the  workplace.  We  also  have  survey  data  done  by 
third  parties  that  tells  us  that  well  over  a  third,  probably 
about  a  half,  of  our  users  are  experienced  users  who  have 
been  on  the  Web  a  long  time. 

You've  recently  added  things  such  as  free  e-mail.  Given 
that  that's  not  likely  to  be  a  great  revenue  prospect, 
what  do  you  gain  from  offering  that  to  users?  It's  a  key 
component  to  this  thing  called  communication  and 
community.  We're  not  only  a  place  where  people  come  to 
get  information,  we're  a  place  where  people  can  find  other 
people.  It  gives  our  users  yet  another  level  of  quality 
experience. 

Another  thing  it  does  for  us  is  create  "stickiness"  to 
Yahoo's  properties.  It  gives  us  yet  another  form  of  direct 
one-to-one  communication  with  our  user  base  because  they 
register  for  the  service.  People  also  register  for  My  Yahoo 
and  Yahoo  Chat.  It  allows  them  to  tell  us  what  they  want. 
-^  I'm  also  fundamentally  convinced  that  it  turns  into 

J  revenue. 

Because  it  keeps  people  on  your  site,  viewing  your  ads 
more  frequently?  Definitely. 

What's  your  philosophy  on  creating  stickiness?  There's 
still  a  common  mistake  among  Web  sites,  trying  to  make  it 
so  you  never  leave  that  site,  which  is  the  antithesis  of  the 
whole  spirit  of  the  Internet.  There's  a  balance  in  there. 
What  we  don't  do  here  is  try  to  figure  out  ways  of  tricking 
people  to  stay  on  Yahoo.  We  have,  probably  at  the  top  of 
our  values,  a  commitment  to  make  it  easy  for  users  to  find 
and  get  cormected  to  content,  no  matter  where  it  is  on  the 
Web. 

However,  we  want  to  continue  to  extend  our  value  to  users. 
As  we  do  that,  we  find  that  users  come  back  and  stay 
longer.  It's  a  subtle  thing,  but  it's  profound. 

Are  people  reluctant  to  register?  What  do  you  do  with 

the  information?  We  never  give  it  to  anyone  else.  We've 

been  asked  to  sell  it  many  times,  but  it  stays  inside  Yahoo, 

and  we  use  it  only  to  the  extent  that  it  helps  us  to  guide  the 

delivery  of  content  to  them.  Ultimately,  we  may  begin  to 

target  ads  to  them  as  they  consume  inventory.  With  the 
\  recent  addition  of  free  e-mail  on  Yahoo,  we've  made  a  pact 

J  with  our  users  to  not  spam  them. 

Future  Technologies 
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You  have  a  relationship  with  Visa.  Do  you  see  a  future 
as  an  electronic-commerce  site?  Yeah,  with  one  subtle 
distinction.  Just  as  we  don't  intend  to  get  into  the 
original-content  business  but  partner  with  original-content 
creators,  we  also  don't  intend  to  get  into  the  business  of 
literally  selling  things  on  Yahoo.  We  do  intend  to  be  a 
good  aggregation  source  for  various  vendors  of  goods  and 
services.  We've  got  a  short  list  of  a  dozen  to  15,  going  on 
20,  commerce  partnerships.  E-Trade.  [Ameri trade],  DLJ 
Direct.  Some  of  them  we  haven't  announced. 

So  you'll  point  people  to  these  sites  and  work  out  some 
revenue-sharing  deals,  take  transaction  fees?  Both.  We 
end  up  being  a  piece  of  virtual  shelf  space,  if  you  will. 
We'll  get  a  fixed  fee  for  the  distribution  space,  then  a  cut  of 
the  transactions. 

What  about  MasterCard  and  American  Express?  We 

have  agreed  with  Visa  to  have  them  as  the  premier  card, 
but  the  others  are  not  excluded. 

You  are  also  doing  audio  broadcasts.  May  I  presume  at 
some  point  video  broadcasts  as  well?  If  streamed  video 
emerges,  then  we  will  take  the  same  position  of 
aggregating  directly  to  the  source.  We'll  offer  people  a 
convenient  place  to  come  and  find  [video]  content  and  then 
deliver  it  to  them. 

What  do  you  think  about  push  technology?  As  you 

know,  there's  a  range  of  technologies  and  forms  for  push. 

Some  people  will  have  some  of  their  content  delivered 
some  of  the  time  via  push.  I've  always  believed  that  you'll 
see  a  spectrum.  There  are  people  on  one  end  of  the 
spectrum  whose  content  needs  are  narrow  and  don't  change 
often.  On  the  other  end  are  people  whose  content  needs  are 
broad  and  change  all  the  time. 

The  people  on  the  narrow  end  of  the  spectrum  can  more 
easily  designate  stuff  to  be  delivered  to  them  regularly  via 
push.  The  folks  on  the  other  end  will  have  a  hard  time 
designating  their  needs. 

We  intend  to  embrace  as  broad  a  user  base  as  possible,  so 
push  is  a  viable  component.  But  it  is  not  the  be-all  and 
end-all. 

You've  also  done  some  deals  with  WebTV  and 
Microsoft.  What  do  you  think  about  the  merger  of  the 
television  and  the  Internet?  There's  something  to  the 
broad  thesis.  There  will  be  a  convergence  between  what 
we  call  a  separate  computing  device  and  a  television. 
Televisions  will  get  smarter  and  bandwidth  will  increase, 
but  it  will  take  time.  Several  generations  of  consumers  now 
have  been  conditioned  to  view  and  use  the  TV  as  a 
one-way  entertainment  device. 
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So  it  requires  not  only  a  huge  change  in  technology,  but 
a  behavioral  change  for  people.  Yes. 

Do  you  think  that  will  happen?  It'll  happen  over  lime.  I 
can  easily  see  five,  10  years  from  now  that  we'll  have  in 
our  homes  and  our  workplaces  devices  that  have  multiple 
uses.  Cost  will  continue  to  plummet  until  it  enables  the 
convergence  of  multiple  functionalities  for  similar  or  lower 
price  points  than  we  see  now  in  PCs.  That  makes  sense. 
Less  clutter  in  your  life. 

Why  not  just  two  separate  devices,  one  for  push  and 
one  interactive?  Possibly.  The  way  we  view  it,  it's  all 
good.  The  more  content  that's  available  on  more  devices, 
the  more  consumers  there  are  accessing  it,  and  the  more 
there  is  for  the  role  of  Yahoo  in  guiding  them  to  it. 

Everybody's  Favorite  Nemesis 

What  do  you  think  about  sites  such  as  MSN,  which  act 
as  aggregator  and  original-content  provider?  Everyone 
who  hangs  out  with  the  folks  in  Redmond  knows  that 
they're  evolving  their  strategy  and  their  vision.  They 
started  off  with  a  strict  approach  of  being  a  proprietary 
online  service  and  charging  a  subscription  fee  and  offering 
access.  But  they're  searching  for  ways  of  evolving  out  of 
that. 

Will  Microsoft  ever  be  a  significant  competitor  to 
Yahoo?  Microsoft  remains  a  company  that  we  admire  a  lot 
and  fear  like  everybody  else!  We  have  a  relationship  with 
them,  and  we  continue  to  look  for  great  ways  of 
augmenting  that,  and  they  are  in  the  process  of  continuing 
to  evolve  their  vision.  We're  mindful  that  they  could  come 
into  our  space. 

What  worries  you  most?  One  thing  that  is  a  challenge  for 
anybody  in  our  space  is  managing  our  growth.  It  turns  out 
to  be  something  we've  done  fine  on  to  date,  but  it  remains 
a  concern,  and  it  probably  always  should  be  until  the  day  I 
set  foot  in  my  grave! 

Richard  L.  Brandt  is  editor  of  UPSIDE. 
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[December  1 1  ]  Free  Web  navigation  service  Alexa  said  that  its 
advertisers  now  have  more  opportunities  to  reach  target  markets 
by  displaying  ads  directly  on  the  Alexa  toolbar. 

The  service,  which  gives  suggested  links  about  where  to  go  next 
on  the  Web  and  detailed  information  about  every  Web  site,  calls 
the  program  its  "Ad  on  the  Bar"  campaign. 

The  free  Alexa  service  is  displayed  as  a  toolbar  on  the  user's 
desktop  and  is  available  for  users  of  Windows  95  and  NT.  Alexa 
can  be  downloaded  from  the  company's  Web  site. 

Designed  to  make  Web  advertising  easier  and  more  focused 
because  Alexa  accompanies  users  while  they  surf,  Alexa's  toolbar 
displays  advertising  images  on  a  continual  basis  as  they  surf  the 
Web.  Advertisers  can  target  users  by  interest  area,  industry, 
demographics,  and  by  advertising  on  their  competitors'  home 
pages. 

The  company  said  that  "only  Alexa  enables  advertisers  to 
advertise  directly  on  competitor  sites.  For  instance,  Barnes  & 
Noble  can  display  its  banner  ad  every  time  the  user  visits  the 
Amazon.com  site,  as  well  as  other  competitor  or  high-  traffic 
sites." 

Initial  advertisers  include  CMP,  Who  Where,  CNET  and  Web 
Turbo.  Rate  card  data  was  not  disclosed. 
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Where  am  I?  How'd  I  get  here?  Where  should  I  go  next?  If  you've  ever 
asked  yourself  these  questions  while  surfing  the  Web,  now  you  can  find 
your  path  with  Alexa  Internet.  The  free  Web-navigation  service  works 
with  Web  browsers  to  provide  a  toolbar  at  the  bottom  of  your  screen  for 
a  continuous  source  of  relevant  recommendations  of  where  to  go  on  the 
Web.  The  recommendations  are  derived  by  analyzing  patterns  and 
anonymous  usage  paths.  You  can  find  the  service  at 
http://www.alexa.com. 

Copyright  (c)  1997  CMP  Media  Inc. 
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Site  Name  :  Telecom  News:  Alexa  Internet  Service  http://www.fTOSt.com/ns-search/ver.. .e/aaaa001psbbec7b&NS-doc-offset=0& 

integration  of  Encyclopedia  Britannica's  reference  materials  coincided  with  the  launch  of  Alexa  1.0,  the 
company's  first  release  of  the  service  since  the  beta  program  was  launched  in  July  1997. 

Owing  to  the  growing  popularity  of  the  Internet,  this  type  of  service  will  undoubtedly  provide  valuable 
information  to  both  users  and  advertisers  on  the  World  Wide  Web.  It  will  provide  helpftil  hints  to  users 
as  to  the  most  effective  means  for  navigating  the  Web  and  inform  Web  site  owners  of  the  popularity  of 
their  sites. 

Request  further  information  on  the  following  related  reports:  Frost  &  Sullivan's  Report  fi^  2880-63.  U.S. 
Internet  Service  Markets,  and  Report  i^  2613-63.  Consumer  Internet  Fee-Based  Services. 

IRG  #  63;  A/T  Code  0811 
©  1997  Frost  &  Sullivan.  All  Rights  Reserved. 
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Alexa 

Alexa  has  come  of  age,  and  we  rate  it  as  one  of  the  most  valuable  free 
resources  for  WWW  surfers.  Alexa  uses  an  humungeous  archive  of 
WWW  sites.  It  provides  powerful  search  capabilities,  technical  and 
ownership  information  on  sites,  and  as  a  bonus  allows  immediate  free 
access  to  the  entire  Encyclopaedia  Britannica  and  Merriam-Webster's 
Dictionary  &  Thesaurus 
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Anawave  Websnake 

Anawave  WebSnake  1.23  will  grab,  search,  and  retrieve  files  from  the 
Internet.  WebSnake  allows  users  to  filter  unwanted  data  and  retheve  the 
information  they  want.  WebSnake  supports  off-line  browsing,  advanced 
searching  and  retrieval  of  files  by  keywords  or  file  type,  website  mirroring, 
creating  graphical  site  maps,  and  retneval  of  e-mail  addresses.  WebSnake 
also  supports  the  ftp  This  is  powerful  software,  almost  essential  for 
anyone  who  who  does  lots  of  surf  research.  Windows95  &  NT. 


AutoWmNet95 

Automated  Internet  for  Windows95/NT  Allows  scheduling  common  tasks 
for  unattended  operation.  Upgrades  automatically,  and  supports  35  steps, 
including  FTP:  retries  busy  sites,  wildcard  download  and  upload, 
make/remove  remote  files  and  directories,  plus  customized  logins.  Sends 
email  with  an  advanced,  feature  packed  editor,  unlimited  multiple 
mailboxes,  checks  mail,  or  cleans  up  your  mailbox.  WWW:  Grab 
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Op  zoek  naar  een  slimme 

zoekmachine?  Probeer  Alexa.  Alexa 

zoekt  gericht,  presenteert 

concurrerende  websites  van  jouw 

favorieten,  vertelt  je  of  een  website 

snel  en  populair  is  en  weet  ook  nog 

heel  oude  webpagina's  boven  water  te 

halen.  En  dat  is  nog  lang  met  alles 


Het  bestbewaarde  geheim  van  het  Internet? 


L  I  B  B  e  nic  i 


lak  bij  de  beroemde  Golden  Gate 
Bridge  m  San  Francisco  ligt  het  Prese- 
dio,  een  door  hoge  sparren  omgeven 
legerkamp  dat  ongeveer  honderd  jaar 
geleden  werd  opgericht   Militairen 
wonen  er  met  meer,  het  grootste  dee!  van  het 
kamp  IS  museum  geworden,  Hoewel  we  er 
voor  het  eerst  zijn,  komt  het  kamp  ons 
bekend  voor  Naar  onze  stellige  overtuiging 
moet  hier  een  van  Columbo's  beste  afleverin- 
gen  zijn  gefilmd  In  een  van  de  witte  barak- 
ken  van  het  kamp  is  Alexa  Internet  gevestigd, 
het  best  bewaarde  geheim  van  het  Presedio 
en  van  het  World  Wide  Web. 

Alexa  werd  in  1996  opgericht  door  Brew- 
ster Kahle  en  Bruce  Gilliat,  Bekende  namen 
voor  Internet  die-hards,  want  beiden  waren  in 
het  verleden  betrokken  bij  WAIS,  een  zoeksys- 
teem  voor  gespecialiseerde  tekstbestanden 
op  het  Internet  WAIS,  voortgekomen  uit  het 
bedrijf  Thinking  Machines,  was  er  al  voor 
(inmiddels  archaische)  navigatiehulpmiddelen 
als  Gopher,  en  dus  ook  ver  voordat  het  World 
Wide  Web  zijn  intrede  deed   In  de  meeste 
Internet-handboeken  wordt  nauwelijks  nog 
aan  WAIS  gerefereerd,  want  de  meeste  WAIS- 
servers  zijn  inmiddels  'normale'  zoeksystemen 
geworden.  Het  gelijknamige  bedrijf  werd  m 
1995  voor  vijftien  miljoen  dollar  aan  online 
gigant  America  Online  verkocht 


Alleennuttigeinfoa.u.b. 

Met  Alexa  borduren  Brewster  Kahle  en  Bruce 
Gilliat  voort  op  de  WAIS-gedachte.  Namelijk 
hoe  vind  je  in  het  doolhof  van  het  Internet 
relevante  informatie  over  onderwerpen  die  le 
interesseren?  Reguliere  zoeksystemen  overla- 
den je  met  duizenden  suggesties,  en  wegwij- 
zers  als  Yahoo  doen  dat  ook,  "Theoretisch 
kun  je  natuurlijk  het  complete  web  indexe- 
ren",  zegt  Brewster  Kahle  in  zijn  kantoor 
"Maar  wat  moet  je  met  al  die  informatie'  Het 
web  verdubbelt  zich  iedere  zes  maanden,  voor 
mensen  is  het  vrijwel  onbegonnen  werk  om 
orde  in  de  chaos  te  brengen,"  Een  wegwijzer 
als  Yahoo  of  Excite  verwijst  misschien  naar 
slechts  een  procent  van  het  web,  aldus  Kahle 
"Zelfs  als  je  met  veel  freelancers  werkt,  zoals 
The  Mining  Company  (zie  kader),  krijg  je  hoog- 
uit  honderdduizend  pagina's  boven  tafel  " 
Alexa  gebruikt  drie  geautomatiseerde 
technieken  om  interessante  webpagina's  op 
te  sporen.  Met  behulp  van  een  gratis  pro- 
gramma  kunnen  webgebruikers  aangeven  of 
zij  bepaalde  sites  nuttig  vmden  of  met,  Ook 
volgt  Alexa  de  bewegingen  van  gebruikers  op 
het  World  Wide  Web.  Daarnaast  wordt  reke- 
ning  gehouden  met  zogenaamde  verwijspagi- 
na's  van  experts,  Kahle:  "Op  het  web  vind  je 
altijd  wel  een  paar  goeroes  die  weten  waar 
Abraham  de  mosterd  haalt  " 


In  feite  maakt  Alexa  gebruik  van  wat  Kah- 
le noemt  'het  collectieve  geheugen  van  de 
webgemeenschap'.  "Ik  huur  graag  speelfilms 
waarin  bepaalde  acteurs  spelen.  Die  namen 
kan  ik  natuurlijk  in  een  f  ilmencyclopedie 
opzoeken,  maar  ik  heb  veel  meer  aan  de  aan- 
bevelingen  van  mijn  filmvrienden   Alexa 
werkt  net  zo." 

Informatie  over  informatie 

Tot  zover  de  theorie,  nu  de  praktijk.  Om  te 
begmnen  moet  je  bij  Alexa  (www.alexa.com) 
een  gratis  programma  ophalen,  circa  1,7 
megabyte  groot,  dat  onder  Windows  95  en  m 
de  toekomst  ook  onder  MacOS  en  Windows 
3.x  als  een  taak  balk  op  J  e  scherm  verse  hunt 
Met  dit  programma  kunje  via  zogenaamde 
popup-menu's  extra  informatie  over  webpagi- 
na's opvragen.  Alexa  noemt  dat  meta-mfor- 
matie,  ofwel  'informatie  over  informatie'.  Zo 
kun  je  bijvoorbeeld  achterhalen  wie  de  eige- 
naar  of  exploitant  is  van  een  site.  Gaat  het  om 
een  beursgenoteerd  bedrijf,  dan  wordt  met 
alleen  het  adres,  maar  ook  de  omzet  vermieii 
Een  deel  van  die  informatie  komt  van  The 
Securities  and  Exchange  Commission  (SEC),  de 
waakhond  van  de  Amerikaanse  beurs,  en 
InterNIC,  dat  domeinnamen  registreert  en 
opslaat  in  een  database  Maar  er  is  meer 
Alexa  vertelt  je  ook  of  de  webserver  door- 
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gaans  snel  of  juist  erg  traag  is,  uit  hoeveel 
pagina's  de  complete  site  bestaat  en  hoe  vajk 
die  pagina's  worden  geactualiseerd  Verder 
staat  er  een  waarderingsci|fer  bi|    'Stel  dat  le 
via  het  Internet  bij  een  reisbureau  een  vlucht 
naar  Bali  wilt  boeken",  zegt  Kahle    "Dan  wil 
je  wel  wat  meer  informatie  over  dat  bedn|f 
hebben.  Met  Alexa  weet  je  in  elk  geval  of  de 
site  wordt  gewaardeerd," 

Alexa  wil  de  dienstverlenmg  overigens  uit- 
breiden.  2o  is  er  een  samenvverkmg  aange- 
kondigd  met  TRUSTe.  een  orgamsatie  die  er 
op  toeziet  dat  bij  elektromsche  transacties  de 
privacy  wordt  gewaarborgd.  Ook  v.erkt  Aiexa 
samen  met  The  Recreational  Software  Advi- 
sory Council,  die  websites  voorziet  van 
'ratings'  voor  taalgebruik,  sex  en  geweld 
Daarnaast  is  er  een  koppeling  mogehjk  met 
de  Encyclopaedia  Britannica,  een  thesaurus  en 
een  woordenboek  (Merriam-Webster  Online's 
WWWebster  Dictionary).  Als  je  een  woord 
tegenkomt  dat  je  met  kent,  kun  )e  snel  het 
synoniem  opzoeken.  Via  popup-menu's  wordt 
ook  naar  andere  relevante  sites  verwezen  Als 
je  bij  een  pagina  van  een  reisbureau  bent 
beland,  zai  Alexa  zi|n  concurrenten  noemen 

404  Not  Found  bestaat  niet  meer 

Uniek  is  de  mogelijkheid  om  van  ledere  web- 
pagina  de  vorige  versie  van  die  pagina  op  te 
vragen,  bijvoorbeeld  als  je  een  '404  Not 
Found'-mededeling  (pagina  met  gevorden)  ^e 
zien  krijgt.  Alexa  archiveert  smds  1996  het 
complete  World  Wide  Web.  Het  beheer  daar- 
van  is  uitbesteed  aan  de  non  profit  orgamsa- 
tie Internet  Archive  (www  archive  org)  Op  dit 
moment  bestaat  het  archief  uit  vi|fhonderd 
tapedrives,  elk  met  een  capaciteit  van  onge- 
veer  twee  terabyte.  Om  het  complete  .veb  te 
archiveren  is  vermoedeli)k  ongeveer  tien  tera- 
byte nodig.  Ter  vergeli)king   's  werelds  groc"- 
ste  bibliotheek,  het  Library  of  Congres,  heeft 
een  omvang  van  twintig  terabyte 

Het  Internet  Archief  is  voigens  Kahle  bitte- 
re  noodzaak.  Kranten  en  tijdschriften  worde-- 
bewaard,  maar  webpagma's  niet  Onderzoek 
van  de  Universiteit  van  Colorado  heeft  uitge- 
wezen  dat  de  levensduur  van  een  webpagma 
hooguit  44  dagen  bedraagt  'Wie  wil  /;eten 
wat  er  op  5  november  1996  zoai  oo  net  inter- 
net gebeurde,  kan  nu  m  elk  geval  di|  net 
Internet  Archief  terecht.  Samen  met  net 
Smithsonian  Institute  werkt  net  internet 
Archive  aan  een  tentoonstellmg  over  de 
invloed  die  het  web  had  op  de  president.ele 
verkiezingen  van  1996.  Op  dit  moment  kan 


Opzoeknaarkwaliteit 


Hoe  vind  \e  kwaliteit  op  het  web?  Hoe  scheid  je  het  kal  van  het  koren?  Menige 
webgebniiker  zaI  het  onmiddelUjk  beamen:  gemakkelijk  is  het  niet.  Oe  meeste 
webgehruikers  beginnen  bij  elektronische  wegwijzers  als  Yahoo.  LookSmart  en  Excite. 
Deze  zoeksystemen  worden  samengesteld  door  redacteuren  die  iedere  dag  het  web 
afstruinen  op  zoek  naar  relevante  informatie.  Maar  deze  redacteuren  zijn  omnivoren.  en 
geen  specialisten. 

ftrgUS  CleSringhOUSe    (www.clearinghouse.net).  een  initiatief  van  de  Universiteit  van 
Michigan,  verwijst  wel  naar  allerlei  specialisten  op  het  web. 

Het  verst  daarin  gaat  The  Mining  Compdny   (www.miningco.com).  Dit  Amerikaanse 
bednjf  huurt  freelancers  in  die  lijsten  met  relevante  websites  op  bun  vakgebied 
samenstellen.  Zo  wordt  de  pagina  over  kindergeneeskunde  onderhouden  door  een  echte 
kinderarts.  The  Mining  Company  kent  als  hoofdcategorieen  onder  meer  kunst  en 
amusement,  zaken.  carriere  en  onderwijs.  gezondheid.  sport  en  reizen.  ledere  gids  moet 
een  korte  opieiding  volgen  waarin  hij  of  zij  vertrouwd  wordt  gemaakt  met 
webtecbnieken  als  HTML  en  Java.  De  eerste  maanden  wordt  hun  werk  doorlopend 
geevalueerd.  Als  iemand  toch  niet  de  gewenste  kwaliteit  levert,  wordt  bij  uit  zijn 
functie  gezet.  De  freelancers  krijgen  veertig  procent  van  de  bruto  advertentie- 
inkomsten  als  honorarium  uitbetaald.  Inmiddels  telt  The  Mining  Company  ongeveer 
duizend  gidsen.  Binnen  vijf  jaar  moet  bij  The  Mining  Company  over  ongeveer 
vierduizend  onderwerpen  informatie  kunnen  worden  opgevraagd. 
tuiSOn  (www.8dison.com)  probeert  ook  orde  in  de  chaos  te  scheppen  via  een 
overzichtelijke  webpagina  met  honderden  verwijzingen  en  een  index  van  zoekmotoren. 
Als  je  wilt  kun  je  de  complete  site  downloaden  en  offline  bekijken. 
CompuServe   (www.compuserve.nl)  is  eind  vorig  jaar  Communities  gestart.  Via  zeven 
hoofdrubrieken  belandt  de  gebruiker  in  submenu's  met  een  verdere  onderverdeling.  Ook 
toegankelijk  voor  niet-leden.  Zie  ook  het  artikel  op  pagina  60. 
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het  archief  overigens  alleen  met  behulp  van 
het  Alexa-programma  worden  ontsloten,  in 
de  toekomst  wellicht  ook  via  zoeksystemen, 
Commerciele  exploitatie  is  vanwege  het 
auteursrecht  vrijwel  uitgesioten.  En:  "Als 
mensen  echt  niet  willen  dat  we  hun  pagina's 
bewaren,  halen  we  ze  onmiddellijk  uit  het 
archief",  zegt  Kahle.  "De  meeste  mensen 
komen  echter  al  gauw  op  hun  besluit  terug." 
Zelf  verdient  Alexa  hoofdzakelijk  aan  adver- 
tenties  die  m  de  taakbalk  worden  weergege- 
ven.  Kahle:  "We  kijken  ook  naar  andere  distri- 
butiemogelijkheden  van  onze  technologie.  De 
bedoeling  is  dat  ledereen  straks  weet  wat 
Alexa  Internet  is." 

▼  lURls 

www.alexa.com 

www.archive.Ofg 

www.clear1119l10use.nel 

www.miningco.com 

www.edison.com 

www.compusecwe.nl 
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Guru, 

I've  been  looking  for  a  browser  that  can  analyze 
Web  sites  when  I  surf  to  them  —  you  know,  like 
who  owns  them  and  how  much  traffic  they  get. 

While  the  new  browsers  have  more  bells  than  the 
cathedral  at  Notre  Dame  and  more  whistles  than 
NFL  referees,  they  still  don't  have  what  you're 
looking  for  just  yet.  But  a  piece  of  Web  navigation 
software  called  Alexa  does. 

Once  you  download  and  install  the  free  (but 
advertising-supported)  software,  Alexa  shows  up  as 
a  small  toolbar  that  hovers  over  your  browser.  The 
"Where  You  Are"  section  of  the  toolbar  provides  a 
mother  lode  of  info  on  any  site  you  connect  to, 
including  who  owns  the  site,  even  providing  a 
phone  number  and  street  address  for  many  (this 
feature  is  limited  to  sites  with  .com,  .org,  .edu,  and 
.net  suffixes  in  their  names,  however).  Alexa  will 
reveal  a  site's  loading  speed,  total  number  of  pages, 
and  when  it's  been  updated.  It  also  gives  you  an 
idea  of  how  much  traffic  it  gets  ~  if  you're  planning 
on  forging  a  mutual  link  with  someone  else's  site, 
this  is  the  perfect  tool  to  help  you  determine  if  it's 
worth  your  while. 

The  "Where  To  Go  Next"  section  of  the  toolbar 
suggests  links  to  other  sites  that  it  thinks  are  related 
to  the  page  you  are  visiting.  These  links  haven't 
been  suggested  willy-nilly  ~  they  come  from  Alexa 
watching  how  people  move  between  sites,  then 
analyzing  these  movements  to  determine  which 
sites  are  the  most  helpful  and  logical  places  to  surf 
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next. 

But  for  me,  the  best  part  of  Alexa  is  its  ability  to 
raise  the  dead.  Dead  sites,  that  is. 

Did  you  know  that  the  average  hfe  span  of  a  Web 
page  is,  according  to  Alexa,  44  days?  That's  shorter 
than  an  NBC  midseason  replacement  sitcom.  Try 
accessing  a  site  that's  gone  belly-up,  and  you  get 
that  annoying  "404  Not  Found"  message.  End  of 
story,  right?  Not  anymore.  Alexa  archives  a  vast 
amount  of  the  Web  on  its  central  servers  ~  when 
you  run  across  a  dead  site,  you  can  connect  with  its 
service  through  the  toolbar  and  ask  it  to  grab  its 
copy  of  that  site.  Currently,  Alexa  stores  about  8  to 
1 0  terabytes  of  such  information.  For  perspective, 
bear  in  mind  that  the  data  in  the  Library  of 
Congress  would  equal  about  20  terabytes  in  ASCII 
form. 

Alexa  works  with  Netscape  Navigator  2.0  or  higher, 
Netscape  Communicator  4.O.,  and  Microsoft 
Internet  Explorer  3.0  or  higher.  Currently  it  only 
runs  on  Win95  or  Windows  NT  4.0,  but  a  Mac 
version  is  coming  soon. 
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Your  e-mail  address  (so  we  can  contact  you,  we  won't  post  it): 

I 
Your  name  (so  we  can  say  who  the  question's  from): 


Your  question  to  the  Surf  Guru: 
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You  arc  now  deep  in  hypnosis.com  -  WWWcb  Server  Home  Page 


http;//www. hypnosis.com/cnotes. him! 


Our  site  went  down  in  December  (understatement,  the  server  is  still  being 
repaired).  To  make  matters  worse  our  back  up  failed  to  recover  anything. 

Up  until  Friday  Jan  1 8th,  we  were  running  on  a  two  year  old  back  up.    Then 
a  customer  e-mailed  us  and  said  we  could  fmd  our  site  at  www.alexa.com 

well,  needless  to  say  we  are  back  up  because  of  them.  If  you  have  a  web  site 
make  sure  you  have  alexa  back  you  up. 


1  ofl 


1/29/98  3:02  PM 


Web  Hosting  Service  -  www.eurosat.com 


http :  //www .  eurosat.  com/eurosat/alexa-rating  .html 


€) 


eurosat 

►  Site 

►  Sear 


A  multi-award  winner,  the  Alexa  software  is  both  free  and  extremely  usefiil.  It 
works  more  or  less  as  a  browser's  torch-light. 

This  is  a  little  program  that  will  add  a  horizontal  bar  to  the  bottom  part  of  your  browser.  And  as 
you  browse,  it  points  the  way,  and  will  inform  you  of  the  popularity  of  the  site  you  currently 
visiting  without  any  bias,  as  well  as  other  useful  information.  All  on  real  time  and  hassle  free. 

Eurosat.com  is  one  of  the  first  satellite  TV  website  that  has  been  rated  Top  10000 

The  raw  data  is  collected  from  all  the  major  backbone  around  the  world,  hence  far  more  reliable 
than  other  dodgy  survey  services.  Such  as  Top  XXXX  etc.  Results  are  generally  very  accurate. 
Exception  is  when  certain  website  carries  far  too  many  tiny  pictures  on  each  page,  throughout 
the  entire  site.  As  each  little  GIF  or  JPG  translate  into  one  hit,  it  may  ultimately  effects  the  total 
outcome.  But  even  this  event  rarely  occur. 

If  in  doubt,  compares  the  rating  to  Alexa  visits  for  consistency.  Generally  speaking, 

Top  10  is  reserved  for  the  Goddess  in  the  world  such  as  AOL  and  Microsoft. 

Top  1 00  is  for  the  Internet  Royal  Families  (CNN,  abcnews). 

Top  1 000  is  for  The  Rich  and  Influential  (Demon,  Euronet,  Techweb). 

Top  10000  for  the  bests  in  its  field. 

Top  25000  for  excellent  well  design  websites, 

Top  100000  for  websites  that  begins  to  show  its  potential. 

So  what  are  you  waiting  for?  Download  it  now.  Their  website  is  waiting  for  you. 

If  you  are  getting  a  Low  Traffic  (meaning  less  than  Top  100000),  there  is  no  reason  to  be 
panicked.  You  do  only  if  you  have  already  invested  millions  and  years. 


►  Statis 

►  Web 
Hostin 

and 
Design 


This  web  site  does  not  offer  pirate  Sky  card  nor  Sky-related  software 

This  site  is  in  no  way  related  to  Eurosat  Distribution(UK)  or  Eurosat  magazine(ltaly) 

All  enquiries  regarding  the  web  site  should  be  directed  at  The  Webmaster 

©  defiant@eurosat.com  1997.  All  Rights  Reserved.  DISCLAIMER 
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As  we  went  from  conference  to  conference  in  1997,  we  encountered  the  same 
phenomenon  over  and  over,  -f-  Internet  startups  founded  in  1995  described 
themselves  as  "old."  And  most  of  them  that  were  still  alive  in  1997  had  changed 
their  business  model — several  times.  Despite  these  unpromising  conditions,  some  of 
them  were  actually  starting  to  make  money.  -Y'-  At  the  same  time,  erstwhile  titans  stum- 
bled— aside  from  Apple's  well-publicized  woes,  CompuServe  was  absorbed  into  AOL 
and  WorldCom,  Informix  and  Sybase  hit  walls  in  the  enterprise  market,  Silicon  Graphics 
saw  its  traditional  markets  eroded  and  its  brains  drained  by  startups,  and  so  on.  ~f 
The  titans  that  didn't  stumble,  led  by  Microsoft  and  Intel,  exhibited  a  phenomenal  lack 
of  complacency.  -Y^  All  this  being  so,  it  will  probably  come  as  little  surprise  that  there  are 
a  lot  of  people  on  this  year's  list  you've  probably  never  heard  of — founders  of  startups, 
one-person  businesses,  intrepid  tech-support  guys,  and  relative  unknowns  who  some- 
how make  the  industry  a  little  easier  to  deal  with.  Some  of  them  will  no  doubt  vanish 
into  peaceful  obscurity  by  next  year;  some  will  probably  live  to  be  Bill  Gates'  biggest 
headache  (or  next  acquisitiori).  -f-  Also  prominent  on  the  list  are  some  folks  we  haven't 
seen  for  a  while — in  many  cases  because  they've  been  building  their  next  startup 
after  the  last  one  gave  up  the  ghost,  and  the  new  one  is  doing  something  pretty  inter- 
esting, 'f  We'd  like  to  thank  our  readers  once  again  for  bringing  many  of  these  deserv- 
ing souls  to  our  attention.  We  also  encourage  you  to  send  us  email  at  editors@microtimes. 
com  to  reproach  us  about  the  equally  deserving  souls  we  may  have  missed! 
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for  major  developinoiu  strides  by  emerging 
Russian  software  development  companies. 
His  own  company  produces  award-winning 
software.  But,  his  major  accomplishments  are 
centered  on  promoting  other  small  Russian 
developers. 

"With  letters  of  recommendation  from 
Microsoft  and  Intel,  with  whom  he  works  close- 
ly, he  brings  companies  into  cooperative  agree- 
ments to  help  them  polish  their  business  prac- 
tices and  products  for  global  distribution.  His 
efforts  help  qualified  individuals  produce  an 
honorable  living  in  Russia's  moribund  economic 
atmosphere.  He  is  building  an  entire  business 
culture,  promoting  production,  innovation  and 
growth.  His  company  provides  an  avenue  to  the 
Western  world. 

"Anet  Software  is  one  of  the  most  established 
development  companies  in  Russia  Mr. 
Gorjushin  built  this  company  with  virtually  no 
outside  investment  and  no  personal  experience 
during  historic  turmoil  and  transition  while 
Russia  changed  from  Communist  to  a 
Democracy.  He  survived  through  the  tumul- 
tuous fall  of  Gorbachev's  Soviet  Union  and  the 
failure  of  Yeltsin's  Federation  of  former 
Communist  Republics.  Anet  thrives  now.  in  the 
constantly  changing  rules  of  the  new  Russia." 

We're  happy  to  join  in  saluting  Mr.  Gorjushin, 
and  highly  recommend  visits  to  his  Web  site. 

Andy  Grove 

Chairman  and  CEO 
Intel 

www.intel.com 
Frankly,  were  not  sure 
about  all  those  danc- 
ing engineers  in  neon 
space  suits,  but  they're 
an  unforgettable  icon 
for  Grove's  latest  mar- 
ket   push    into    the 

world  of  video,  from  conferencing  to  gaming 
That  in  turn  is  only  the  latest  indication  of  his 
tireless  quest  to  anticipate  and  capitalize  on 
emerging  markets.  Meanwhile,  back  in  the 
mainstream,  he  also  keeps  the  Pentium  family 
well  ahead  of  its  competition  as  the  silicon  plat- 
form of  choice. 

JeH  Hawkins 

Chief  Technical  Officer 
Palm  Computing 
http://palmpilot.3com.com/ 

After  years  of  struggling  with  the  assorted  prob- 
lems of  pen-based  computing,  something  that's 
been  occupying  him   since   the   GriDPad, 
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Hawkins  and  his  team  rinally  hit  the  sweet  spot 
with  the  Palm  Pilot.  The  little  PDA  was  a  block- 
buster in  1997,  bringing  new  credibility  to  a 
market  that's  been  longer  on  hype  than  func- 
tionality, and  spawning  an  entire  industry  of 
applications  and  add-ons. 

Frank  Huang 

Chairman  and  CEO 
UMAX  Data  Systems 
www.5upetmac.com 

While  Huang  has  long  kept  Taiwan-based 
^UJWIAX^a^'y  player  in  the  arena  of  PCs,  mother- 
boards, scanners,  and  other  devices,  his  most 
stunning  accomplishment  during  1997  was 
retaining  UMAX/SuperMac's  status  as  the  only 
worldwide  licensee  of  the  MacOS  at  a  time 
when  Maccloners  were  experiencing  a  Jobs- 
induced  bloodbath.  Not  only  is  UMAX  offering 
its  customers  OS  8-based  models,  Huang 
extracted  a  pledge  from  Apple  to  to  certify 
future  UMAX  products  and  include  MacOS  8  in 
all  new  UMAX  system  designs  (excluding  those 
based  on  CHRP)  through  July,  1998. 

Echoing  the  sentiments  ot  a  grateful  populace, 
Huan^  commented:  "Mac  clones  today  remain 
an  important  part  of  a  healthy  Macintosh  mar- 
ket, aiid  UMAX  is  committed  to  leading  this  vital 
market  segment  with  world-class  products  and 
world-class  partners  such  as  Apple.  UMAX  has 
always  had  a  strong  relationship  with  Apple.  We 
are  confident  that  we  will  maintain  this  relation- 
ship and  continue  to  contribute  to  the  advance- 
ment of  the  Macintosh  platform,  and  expand  the 
Macintosh  market  as  a  whole." 

Oave  Hughes 

Managing  Partner 
Old  Colorado  City  Communications 
www.oldcolo.com 

Unlicensed-spectrum  wireless's  biggest  evange- 
list, Hughes  made  excellent  progress  during 
1997  with  his  National  Science  Foundation 
grant  to  deploy  wireless  Internet  access  in 
Colorado's  San  Luis  Valley  The  service  is  up  and 
running,  delighting  rural  schools  and  discomfit- 
ing the  local  telcos  prepared  to  charge  through 
the  nose  for'similar  access. 

International  Brotherhood  of  Teamsters 

www.teamster.org 

Normally  the  reamsters  are  a  bit  far  afield  from 
the  high-tech  industry,  but  their  successful  strike 
against  UPS  helped  focus  attention  on  a  similar 
inequity  in  the  treatment  of  the  computer  indus- 
try's contract  workers.  We  hope  the  cautionary 
episode  resonates  in  the  pro|)er  quarters. 


Irwin  Jacobs 

Chairman  and  CEO 
Harvey  White 

President 
Qualcomm 
www.qualcomm.com 
Qualcomm's  co- 

founders  have  built  the 
San  Diego  company 
into  a  leader  in  wire- 
less communications,  from  satellites  to  cell- 
phones— to  say  nothing  of  the  various  Eudora 
email  packages.  Departing  a  bit  from  the  strictly 
technological,  they  also  entered  into  a  deal  with 
the  city  of  San  Diego  by  which  the  faciliry  hous- 
ing Chargers  and  Padres  games  will  henceforth 
be  known  as  Qualcomm  Stadium. 

Steve  Jobs 

Acting  CEO 

Apple  Computer 

www.apple.com 

"He  has  resurrected 

Apple  from  the  brink  of 

insignificance!" 

enthuses  one  reader. 

Well,  maybe.  It 
remains  to  be  seen  whether  founder  Jobs  has 
succeeded  in  reversing  Apple's  death  spiral, 
though  the  G3  machines  represent  a  much- 
needed  step  in  the  right  direction.  We  wish  him 
well,  particularly  since  a  Mac-less  world  is  too 
horrible  to  contemplate. 

Brewster  Kahle 

President  and  CEO 
Alexa  Internet 
www.alexa.com 
Having  sold  WAIS  to 
AOL,  Kahle  has  found 
himself  a   new   pro- 
|ect— a  Web  navigation 
tool  designed  to  learn 
from  its  users  and  pro- 
duce   affinity-based 
recomniciulations  of  where  to  go  next.  The 
Alexa  software  is  immeasurably  aided  in  this 
endeavor  by  one  of  Kahle's  ongoing  projects — 
archiving  the  entire  Internet. 

Stephen  Kahng 

Founder  and  CEO 
Power  Computing 
wivw.powercc.com 

rhe  unchallenged  winner  of  the  1997  No  Good 
Deed  Goes  Unpunished  Award,  Kahng  effective- 
ly kept  the  Macintosh  platform  alive  while 
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Jakob  Nielsen's  Alertbox  for  February  8,  1998: 

The  Reputation  Manager 

There  are  two  ways  for  users  to  know  which  of  the  two  million  websites  to  visit: 

•  Brand:  the  user  knows  that  a  site  usually  provides  certain  qualities;  if  the  user  likes  these 
qualities,  then  it  will  probably  be  worth  visiting  more  pages  on  that  site 

•  Reputation:  the  user  follows  the  advice  of  other  users  who  know  that  a  site  has  quality 

Branding  is  the  focus  of  most  current  Web  projects:  the  theory  is  that  building  a  powerful  brand  while 
the  Web  is  still  relatively  small  will  allow  a  site  to  be  profitable  when  the  Web  gets  larger.  A  example  of 
a  good  Web  brand  is  news.com:  when  you  want  to  know  what's  happening  in  the  information  industry, 
you  will  usually  be  able  to  find  recent  news  at  news.com. 

Even  though  brands  work  well  for  a  few,  large  sites,  they  are  not  a  good  mechanism  to  help  users  handle 
millions  of  sites.  On  the  contrary,  the  nature  of  the  Web  encourages  the  formation  of  many  smaller  sites, 
and  most  of  its  value  comes  from  such  specialized  sites.  Thus,  the  Web  needs  a  mechanism  for  making 
sense  of  overwhelming  diversity. 

Since  there  is  no  way  for  computers  to  automatically  measure  quality,  we  have  to  rely  on  human 
judgment  for  Web  quality  ratings.  The  reputation  manager  is  a  way  to  automate  the  processing  of  such 
human  judgments;  not  a  way  to  make  the  judgments  themselves.  In  other  words,  quality  needs  to 
become  an  explicit  attribute  of  Web  objects. 

My  vision  for  a  reputation  manager  involves  the  coordination  of  billions  of  individual  quality  judgments 
by  hundreds  of  millions  of  users.  Every  time  you  encounter  an  information  source  on  the  Internet,  your 
Web  client  software  will  present  you  with  an  opportunity  to  vote  on  its  quality.  Typically,  this  would  be 
done  by  adding  two  buttons  to  the  interface:  a  thumbs-up  button  and  a  thumbs-down  button.  A  neutral 
rating  would  be  given  by  doing  nothing  (since  we  want  to  minimize  overhead  in  the  user  interface),  but 
when  a  user  encounters  something  particularly  good,  he  or  she  would  hit  the  "good"  button.  Similarly, 
disappointing  services  would  be  punished  by  a  click  on  "bad." 

The  simplest  reputation  manager  would  compute  the  average  rating  for  each  information  source,  but 
more  advanced  services  would  use  ideas  from  collaborative  filtering  and  compute  different  ratings  for 
different  users.  Basically,  the  reputation  manager  would  find  other  users  whose  tastes  are  very  similar 
to  your  own  and  give  added  weight  to  these  users'  ratings.  Since  the  Web  will  have  half  a  billion  users 
in  five  years,  it  will  always  be  possible  to  find  other  users  who  match  your  interests,  no  matter  how 
obscure  they  are.  Thus,  the  reputation  manager  can  deal  differently  with  people  who  love  the  Spice  Girls 
and  people  who  don't. 

The  reputation  manager  will  collect  ratings  for  entire  websites,  for  individual  pages,  and  for  people  who 
contribute  comments  to  discussion  groups  or  chat  rooms.  The  resulting  reputation  can  be  used  to  direct 
users  to  sites  that  will  be  helpful  or  interesdng,  and  it  can  be  used  to  filter  out  the  less  valuable 
contributors  in  chat  rooms  and  online  discussions.  In  this  way,  the  reputation  manager  becomes  a  more 
powerful  version  of  the  bozo  filter. 

Reputation  management  will  be  especially  valuable  when  combined  with  micro-pavments:  once  you 
have  to  pay  for  clicks,  you  will  be  motivated  to  find  out  in  advance  whether  the  destination  website  is 
any  good. 

Implementing  Reputation  on  the  Web 
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Initially,  I  expect  reputation  managers  to  become  embedded  in  proxy  servers  for  large  corporations  and 
offered  as  a  value-added  service  by  larger  Internet  Service  Providers.  In  both  cases,  it  is  possible  to 
collect  data  about  the  behavior  of  a  large  number  of  users  at  a  single  point.  In  about  three  years 
reputation  management  will  become  an  Internet-wide  service  that  users  can  subscribe  to  by  paying  a 
small  micro-payment  for  each  recommendation. 

There  is  already  an  independent  reputation  manager  available  on  the  Internet:  Alexa  -  which 
unfortunately  is  a  browser  add-on  and  thus  not  fully  integrated  with  the  user's  Web  client  software  (of 
course,  right  now,  browsers  serve  as  a  weak  type  of  Web  access  clients).  The  most  relevant  features  of 
Alexa  relative  to  this  column  are: 

•  Reputation  statistics  for  most  sites  on  the  Web  showing  how  frequently  they  are  visited  and  how 
popular  they  are.  For  example,  Alexa's  page  about  www.useit.com  shows  how  many  of  Alexa's 
users  like  my  site  and  how  many  don't  like  it. 

•  Recommendation  links  to  good  other  sites  that  are  related  to  the  current  page. 

Even  without  any  fancy  statistics,  most  websites  could  benefit  from  explicit  use  of  quality  ratings  in 

their  interface.  A  simple  logfile  analysis  will  show  you  what  parts  of  your  site  are  the  most  popular,  and 
it  would  be  reasonable  to  give  these  pages  special  prominence  in  search  results  and  in  listings.  For 
example,  my  own  Hst  of  old  Alertbox  columns  highlights  those  columns  that  have  attracted  the  largest 
readership  in  the  past.  Doing  so  prevents  new  readers  from  being  overwhelmed  by  choices  and  allows 
them  to  focus  on  the  links  that  are  most  likely  to  be  of  interest.  A  site  with  an  even  larger  set  of  old 
material  could  provide  a  single  page  listing  nothing  but  its  top  hits. 

February  22:  Tracking  the  grov^h  of  a  site 
See  Also:  List  of  other  Alertbox  columns 
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Alexa 

Brewster  Kahle  invented  the  WAIS  (Wide  Area  Information  Server) 
system  and  then  went  on  to  start  WAIS  Inc,  an  electronic 
publishing  company.  Bruce  Gilliat  was  VP  of  sales  and  marketing 
at  WAIS  Inc. 

So?  Well,  they're  now  the  guys  behind  Alexa,  a  new  Net  service 
built  on  a  question  messrs  Kahle  and  Gilliat  asked  themselves. 

Q:"We  all  experience  the  gaps  in  navigating  and  finding 
information  on  the  Internet  as  it  is  used  today-frustrating  keyword 
searches  that  turn  up  hundreds  or  thousands  of  web  pages  and 
sites,  very  few  of  which  are  of  any  interest.  What  if  we,  as  a 
community  of  users,  could  effortlessly  pool  our  collective 
experience  and  add  human  intelligence  to  navigation?  What  if  we 
could  fill  in  those  gaps?  It  is  a  radical  concept ..." 


Their  answer  was,  of  course,  to  create  the  Alexa  service, 
Internet  software  product  that  learns  from  people". 


'the  first 


And,  "This  intelligent  navigation  service  provides  a  continuous 
source  of  relevant  recommendations  of  where  to  go  next  on  the 
Web,"  says  the  Kahle/Gilliat  team.  "Alexa  works  in  parallel  with, 
and  independently  of  Web  browsers,  and  is  displayed  as  a  toolbar 
on  the  user's  desktop.  The  more  people  use  Alexa,  the  more  we  all 
benefit." 

Where's  the  benefit  for  K&G?  In  their  bank  balances,  of  course. 
But  what  else  is  new?  Alexa  provides  context-sensitive  advertising 
in  its  *Where  to  Go  Next*  selections  and  the  blurb  says,  "While  you 
will  receive  advertisements  when  using  the  service,  the  ads  will  be 
relevant  to  what  you  are  viewing  and  in  some  cases  may  act  as 
additional  'Where  to  Go  Next'  selections." 

In  short,  you  could  be  forgiven 

for  cynically  thinking  that 

Alexa  just  might  be  yet 

another  push  media  app  thinly 

disguised  as  a  browser 

plugin.  But  for  all  of  that,  it's 

fairly  cool,  if  only  for  its  ability 

to  give  you  an  instant  snap  of 

sites  you  land  on  which 

subscribe  to  it.  And  even  if 

they're  not  members,  you  still 

get  a  little  info.  That's  us  on 

the  right ;  )  It  also  gives  you  a 

means  of  keeping  in  touch 

with  other  people  who're  hooked  up  to  Alexa  by  allowing  you  to 

send  and  receiving  messages. 


Usefulness: 


^^^  ^ 


Performance: 


User  Friendly: 


ndSTE 


Alexa  v1 .20 

iPkuGkAM^^^^^  If  you'd  like  us  to  review  your  software,  please  contact  Jon  and  we'll  be  glad 
^^^^S.  to  take  a  look. 
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Manning  the  Wires 


by  Ric  Manning 


ALEXA  BRINGS 

DEAD  WEBS  BACK  TO  LIFE 


Wte- 


im£J 
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rewster  Kahle  has  found  an  answer  to  those 


pesky  "404  File  Not  Found"  error  messages:  He 
simply  copies  the  entire  World  Wide  Web  to  his  hard  drive.  If  someone 
decides  to  take  a  site  or  an  individual  page  offline,  he's  still  got  a  copy. 

That  might  strike  you  as  a  fairly  harebrained  idea,  but  Kahle  has  been 
knocking  around  the  Internet  for  quite  a  while  and  he  swears  he  knows  what 
he's  doing. 

In  the  early  1980s,  Kahle  developed  WAIS  (Wide  Area  Information  Server), 
one  of  the  first  systems  for  publishing  and  indexing  information  on  the 
Internet.  WAIS  eventually  lost  out  to  HTML,  but  America  Online  was 
impressed  enough  that  it  paid  Kahle  several  million  dollars  for  the  WAIS 
technology.  Kahle  used  the  money  to  set  up  shop  in  San  Francisco's  Presidio 
and  create  a  company  and  a  product  called  Alexa  Internet  (w^v^w. alexa.com).  a 
new  web  navigation  system  that  Kahle  named  after  the  lost  library  of 
Alexandria. 

Every  few  months,  Alexa  takes  a  snapshot  of  the  public  portion  of  the  Web. 
By  Kahle's  count,  that's  about  640,000  web  sites  and  more  than  130  million 
pages,  plus  all  the  messages  posted  on  Usenet  newsgroups.  The  whole  thing 
is  stored  on  about  500  data  tapes  for  a  total  of  about  6  terabytes,  about  as 
much  data  as  you'd  find  in  a  large  city  library. 

With  the  whole  Web  on  tape,  Kahle  said  he  is  able  to  study  it  in  ways  that 
others  can't.  "We  have  a  deep  knowledge  of  what  the  Web  is  and  we  are 
trying  to  produce  tools  for  people  to  make  better  use  of  the  Net,"  he  said. 

The  first  of  those  tools  is  a  browser  add-on  called  Alexa  that  can  be 
dovmloaded  free  from  Alexa's  web  site.  The  program  creates  a  toolbar  that 
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helps  web  surfers  understand  where  they  are  and  where  they  might  want  to 
go.  Bring  up  a  web  page  with  Alexa  running  and  it  will  tell  you  who 
registered  the  domain  name  and  the  number  of  pages  at  the  site.  It  will  also 
rate  a  site's  speed,  quality  and  popularity. 

"This  is  not  a  tool  for  someone  who  just  wants  to  see  pretty  girls  dancing 
across  the  screen,"  Kahle  said.  "We  think  of  this  as  a  really  good  tool  for  web 
masters  and  information  professionals.  It's  for  people  who  are  going  to  a  lot 
of  different  sites  and  trying  to  make  sense  of  what  they  see.  It  will  tell  you 
who's  behind  this  web  site,  how  big  is  it  and  can  I  trust  it?" 

Kahle  said  the  system  uses  InterNIC  domain  registration  data  to  find  a  site's 
owners.  Traffic  estimates  are  based  on  counts  irom  proxy  caches  at  large 
network  access  points.  Alexa's  own  crawlers  measure  server  speed.  The 
quality  ratings  are  contributed  by  Alexa  users. 

The  Alexa  toolbar  will  also  bring  dead  web  page  back  to  life.  If  you  get  a 
"404"  error,  you  can  hit  Alexa's  archive  button  to  get  the  most  recent  version 
that  Alexa  captured.  An  example  is  the  web  site  created  by  the  Heaven's  Gate 
cult.  The  pages  are  no  longer  online,  but  Alexa  can  serve  them  from  its 
archive. 

Alexa  uses  a  robotic  system  to  fetch  tapes  from  its  archive,  so  retrieval 
sometimes  takes  several  minutes.  But  if  Alexa  has  the  page,  it  will  display  it 
with  all  accompanying  graphics. 

Kahle  said  Alexa  copies  only  material  that  is  offered  to  the  public.  The  Alexa 
indexing  agent  bypasses  pages  that  contain  a  robots.txt  file,  which  tells  web 
crawlers  that  they  don't  want  to  be  indexed. 

"We'll  respect  that,"  he  said.  "There's  even  an  explanation  of  how  to  use  the 
file  on  our  site." 

At  the  same  time,  web  masters  can  invite  Alexa  to  archive  their  sites  by 
filling  out  a  simple  form  on  the  Alexa  web  site. 

Bringing  web  pages  back  fi-om  the  dead  is  a  pretty  good  trick,  but  it  may  not 
be  Alexa's  best.  Using  anonymous  traffic  data  fi'om  some  network  access 
points,  Alexa  determines  where  surfers  go  when  they  leave  a  particular  web 
site.  Every  time  you  visit  a  site,  the  Alexa  toolbar  tells  you  what  paths  were 
followed  by  the  people  who  came  before  you. 

Check  in  at  the  White  House  site,  for  instance,  and  Alexa  will  might  point 
you  toward  the  sites  for  the  House  of  Representatives,  the  U.S.  Senate  and 
THOMAS,  the  congressional  information  site. 


Many  of  Alexa's  links  seem  obvious.  At 
Boardwatch  's  home  page,  Alexa  points  to  three 
sites  that  deal  with  bulletin  board  systems.  At 
www.msnbc.com.  for  instance,  Alexa  will  point 
you  to  other  TV  network  sites,  to  Comedy 
Central  and  MTV.  At  other  times,  Alexa  will 

suggest  sites  that  don't  have  clear  associations  or  sites  that  are  related,  but 

new  to  you. 


"There  are  communities  of  people  who  circle  around  information,"  Kahle 
said.  By  tracking  their  surfing  patterns,  Alexa  can  identify  groups  with 
common  interests. 
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And  Kahle  sees  Alexa  developing  more  sen/ices 
that  could  be  useful  to  web  masters.  For 
instance,  Alexa  may  be  able  to  determine  what 
kinds  of  people  use  a  particular  site. 

"A  lot  of  our  users  have  given  us  demographic 
profiles,"  he  said.  That  information  can  be  used 
to  match  web  sites  with  specific  groups  of 

people,  it  can  also  help  people  who  use  the  Web  have  a  better  understanding 

of  it. 


"The  search  engines  are  going  to  the  10  channels,"  said  Kahle.  "We  don't 
think  that's  what  the  Web  is  about.  We  think  it's  about  diversity.  People  who 
use  the  Web  go  to  a  phenomenal  number  of  servers.  They're  not  just  there  to 
get  the  weather  and  the  sports  scores." 

WEB  LUBE 

Could  your  site  use  a  quick  checkup?  The  web  mechanics  at  the  Web  Site 
Garage  (www^.websitegarage.com)  are  only  too  happy  to  critique  your  work. 

Type  in  your  URL  (or  someone  else's)  and  the  garage  will  deliver  a  five-point 
report  that  gauges  load  time,  looks  for  dead  links  and  spelling  errors,  and 
evaluates  your  HTML  design.  It  also  calculates  the  popularity  of  your  site  by 
counting  how  many  other  web  sites  link  to  yours. 

You  can  use  the  report  to  make  improvements  or  you  can  hire  the  garage's 
team  to  give  your  site  a  makeover. 
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additional  ways  to 
navigate  tlie  Web 


Now  a  new,  free  ier-.  ;ce  j.i:i  i;;\e 
'.  ou  vital  informatiijn  on  how  pop- 
ular a  Web  site  is  and  how  manv 
people  liked  and  disliked  their 
::Ueractions  with  ;:.  E\-en  better, 
you  can  jump  to  related  sites  without 
having  to  resort  to  a  ke>•^vord  search,  and 
vou  can  retrieve  old  W'eb  pages  that  have  dis- 


WiNDOws  USERS  ARE  increaslnglv ex- 
ploring the  World  Wide  Web/Pewer 
than  20  percent  of  Windows  PCs  don't 
have  Internet  access,  according  to  a  re- 
cent Microsoft  press  briefing.  But  look- 
ing at  a  Web  site  and  having  confidence  tha" 
the  site  is  legitimate  an.d  trustworrh'.  are  two 
different  things. 


appeared  t'roni  the  Net. 
Alexa  is  a  relati\-ely  small  free- 
ware download  (918KB  in  size), 
that  works  with  Netscape  Nr- 
igator  3.0  or  later  and  Internet 
Explorer  3.0  or  later  on  Windows  95 
and  Windows  NT.  (If  you  are  using  Windows 
93,  vou  need  a  ver'^ion  of  the  file  Kernel32.dll 
dated  later  than  Feb.  2,  1996.  You  can  di  un- 
load the  updated  version  from  Ale.xa's  site.) 


When  you  install  Alexa,  a  small  toolbar 
appears  at  the  bottom  of  your  screen  when 
your  browser  is  active.  WTiile  you're  viewing  a 
Web  site,  clicking  the  Where  .Am  I  button  on 
Alexa's  toolbar  reveals  a  wealth  of  information. 

Leading  the  display  is  a  ranking  of  the  num- 
ber of  hits  the  site  receives  based  on  an  analy- 
sis of  traffic  across  the  Internet  backbone. 

Even  more  useful  is  the  name,  address,  and 
phone  number  of  the  site's  owner. 

This  information  is  retrieved  from  the 
di^main  registrar  InterNTC  and  includes  the 
lePiJth  of  time  the  domain  has  been  in  ex!-- 
r,  r.ce.  Because  longe\itv  alone  is  not  a  good 
indicator  of  an  Internet  business'  reliability, 
.\!exa  collects  votes  from  its  users  on  whether 
they  liked  a  site.  Unfortunately,  you  can't 
learn  why  visitors  voted  as  they  did.  But  in 
one  case,  for  example,  a  particular  travel  ser- 
vice de\'eloped  a  large  number  of  "dislike" 
'.  iites  —  perhaps  a  hint  that  vou  should  con- 
sider a  different  service  for  your  travel  needs. 

'_lne  of  the  most  remarkable  features  of 
.\!exa  is  that  it  can  retrie\e  old  Web  pages  that 
b.jve  moved  or  changed.  When  you  see  the 
message  "404  Not  Found"  in  your  browser, 
you  click  the  Archive  of  the  Web  button  and 
Alexa  will  attempt  to  display  the  most  recent 
archived  version  of  that  VVeb  page. 

This  feature  grew  out  ot  the  Internet 
Archive  project,  an  effort  to  preserve  Web 
pages  for  posterity.  Officials  at  Alexa's  parent 
company,  San  Francisco-based  Alexa  Inter- 
net, say  it  holds  three  different  "snapshots"  of 
more  than  500,000  Web  sites  that  have  been 
collected  since  early  1996.  This  information 
occupies  8  terabytes  (8,000GB)  of  storage  — 
eqiii\:\!ent  to  .ill  of  the  data  in  all  of  the  tapes 
in  I  r\  pical  video  store. 


Alexa  uses  the  links  between  sites,  as  wen 
as  Its  analysis  of  how  users  jump  from  site  to 
site,  to  determine  other  sites  that  may  be  rel- 
e\  ant  to  the  site  you're  currently  viewing. 
Clicking  Alexa's  Where  to  Go  Next  button 
lists  these  suggestions.  The  links  aren't  always 
pertinent,  but  they're  a  start.  Alexa  supports 
itself  with  small  advertisements  in  this  box. 

You  don't  even  need  Alexa  to  use  some 
ot  its  data.  For  e.xample,  set  your  browser  to 
http://widener.alexa.com/sitedata/yahoo 
.com  to  see  contact  and  rating  information 
on  the  Yahoo  search  engine  site.  Change 
yahoo.com  to  any  site  you  wish. 

But  the  download  is  well  worth  it.  Go  to 
http://www.alexa.com/download.  I  would 
like  to  thank  Danny  Sullivan  at  http:  search 
engine.com  for  this  suggestion. 


Biiiin  Livini^iton  is  the  co-author  of  several 
beit-selling  Windows  books,  inchiding  the 
most  recent  Windows  95  Secrets  (IDG 
Books).  Send  tips  to  bria)i_Uvingston& 
nit  'world. com.  He  regrets  that  he  cannot 
an-wer  individual  questions. 
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Does  Yahoo  Still  Yahoo? 

by  Chris  Oakes 

3:43pm   1 1.Feb.98.PST 

Yahoo  may  soon  have  to  admit  that  the  very 
service  that  made  it  the  Web's  Number  One  site 
is  inherently  flawed.  As  Web  grov^^th  explodes, 
critics  charge,  up  to  a  third  of  the  sites  seeking  a 
listing  in  the  popular  directory  don't  get  in. 

Even  Yahoo  has  long,  if  quietly,  admitted  that 
some  sites  may  take  months,  even  years,  to  get 
listed  at  all.  And  the  impact  can  be  dramatic,  as  a 
listing  in  Yahoo  can  mean  feast  or  famine  when  It 
comes  to  attracting  browsers  to  a  Web  site. 

Just  as  it  reaches  new  levels  of  popularity  and 
business  success  -  the  Web's  foremost  directory 
service  may  soon  be  unable  to  call  itself 
anywhere  near  up-to-date. 

"They're  getting  out  of  the  [directory]  business, 
and  they  should  be  honest  about  that,"  said  Louis 
Rosenfeld,  a  man  who  has  had  personal 
experience  with  the  difficulty  of  getting  a  site 
listed  in  Yahoo. 

In  fact,  unexpected  reinforcement  of  his  charge 
can  be  found  in  Yahoo's  most  recent  published 
annual  report  of  1996,  which.  In  outlining  the 
threats  to  Its  business,  reads  "the  Company  has 
from  time  to  time  experienced  significant  delays 
in  the  processing  of  submissions,  and  further 
delays  could  have  a  material  adverse  effect  on 
the  Company's  goodwill  among  Web  users  and 
content  providers,  and  on  the  Company's 
business." 

"Why  Is  this  not  a  problem  anymore?"  asks 
Danny  Sullivan,  editor  of  Search  Engine  Watch. 
which  keeps  an  eye.on  the  doings  of  Web 
directories  and  search  engines. 

Submitter  Rosenfeld  chronicled  ongoing 
problems  with  his  Yahoo  listings.  While  he  did  get 
his  site  listed  years  ago,  his  efforts  to  make 
corrections  when  his  listings  became  outdated  or 
Incorrect  were  exercises  in  frustration. 

"I've  had  terrible  problems  with  the  same  URLs 
going  back  for  three  years,"  he  said.  After  waging 
a  "campaign  to  reach  a  human"  at  Yahoo,  he 
found  his  contact  very  friendly  and  responsive  - 
but  the  fixes  they  made  went  only  halfway.  Today, 
half  his  listings  remain  incorrect,  and  after  his 
initial  go-round,  he  sees  their  complete  correction 
as  all  but  hopeless. 

Something  of  a  Yahoo  competitor,  Rosenfeld  in 
fact  runs  his  own  service  aimed  at  specialized, 
rather  than  generalized,  cataloging  of  topical  sites 
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^^  In  their  office  at  S.F.'s  Presidio,  Alexa  co-founders  Brewster  Kahle  (left)  and  Bruce  Gilliat  stood  amid  their  machinery. 

"  Archiving  the  Internet 
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Brewster  Kahle 
makes  digital 
snapshots  of  Web 


By  Carolyn  Said 
ChroRicle  Technology  Editor 


rewsier  Kanie  is  creating  the 
Internet  equivalent  of  the  Li- 
brary of  Congress. 
The  37-year-old  programmer 
and  entrepreneur  has  been  capturing 
and  archiving  every  public  Web  page 
since  1996. 

His  nonprofit  Internet  Archive 
serves  as  a  historical  record  of  cyber- 
space. His  for-profit  company,  Alexa  In- 
ternet, uses  the  archive  as  part  of  an  in- 
novative search  tool  that  lets  users  call 
up  "out-of-print"  Web  pages. 


2  Mtnths  t>  Coptttf 

From  a  100-year-old,  red-roofed  of- 
fice in  the  Presidio,  Alexa's  32  employ- 
ees send  out  computer  programs  that 
crawl  the  Internet  to  find  and  download 
Web  pages.  It  takes  about  two  months  to 
capture  the  entire  Web  —  currently 
some  300  million  pages. 

Along  with  the  actual  pages,  the  pro- 
grams retrieve  and  store  "metadata"  — 
information  about  each  site,  such  as  how 
many  people  visited  it,  where  on  the 
Web  they  went  next  and  what  other  pag- 
es are  linked  to  it. 

The  Web  pages  are  stored  digitally 
on  a  "jukebox"  tape  drive  the  size  of  two 
soda  machines.  It  contains  10  terabytes 
of  data  —  as  much  information  as  one- 
half  the  entire  Library  of  Congress. 

Like  that  institution,  the  Internet  Ar- 
chive doesn't  exclude  information  be- 
cause it's  trivial,  dull  or  just  plain  weird. 


AVirtMclUbrary 

"Of  course,  we've  got  more  pictures 
of  Cindy  Crawford  than  the  Library  of 
Congress  does,"  said  Kahle.  But  to  create 
an  accurate  portrayal  of  our  life  and 
times,  it's  necessary  "to  capture  all  the 
dreck  you  could  ever  want." 

Having  created  a  virtual  library,  the 
next  step  was  to  make  a  better  card  cata- 
log. So  Kahle  and  partner  Bruce  Gilliat 
started  Alexa,  named  after  the  ancient 
Library  of  Alexandria. 

Alexa's  search  engine  uses  the  Ar- 


chive's  metadata  to  help  users  find  infor- 
mation based  on  the  trails  of  other  Inter- 
net surfers. 

The  search  engine,  available  for  free 
at  wivw.alexa.com,  is  a  toolbar  that  sits 
along  the  bottom  of  a  Web  browser.  It 
looks  at  the  site  a  user  is  currently  view- 
ing and  suggests  other  pages  by  analyz- 
ing where  previous  visitors  to  that  site 
went  next. 

OlJSItgttcVicw 

What  separates  Alexa  from  other 
search  engines  is  that  it  lets  users  view 
sites  that  have  been  removed  from  the 
Web. 

When  they  encounter  the  message 
"404  Document  Not  Found"  users  can 
click  on  the  Alexa  toolbar  to  fetch  the 
out  of  print  Web  page  from  the  Internet 
Archive. 

Alexa  is  supported  by  advertising, 
but  even  the  ads  relate  to  users'  inter- 


"Of  course y  we've  got 
more  pictures  of  Cindy 
Crawford  than  the 
Library  of  Congress 
does.'* 

—  BREWSTER  KAHLE,  Alexa  co-founder 


ests.  A  visitor  to  the  Amazon.com  Web 
site  might  see  a  Barnes  &  Noble  ad. 

"Clearly  we  need  better  tools  for  ex- 
ploring the  Web,"  says  Peter  Lyman, 
head  librarian  for  the  University  of  Cali- 
fornia at  Berkeley  and  an  Internet  Ar- 
chive board  member.  "Alexa  is  trying  to 
help  us  find  our  way  out  of  the  forest  by 
looking  for  trails  where  previous  people 
have  gone.  It's  the  most  promising  idea 
about  how  we'll  search  the  Internet  in 
the  future." 

Crcndcr  Pt<iH$ 

Available  since  September,  Alexa  al- 
ready has  100,000  users  but  Kahle  has 
grander  plans  for  it. 

"Our  goal  is  to  make  this  part  of  the 
infrastructure  of  the  Internet,"  he  said. 

One  surefire  way  to  achieve  that  sta- 
tus would  be  to  sell  Alexa  to  a  browser 
company,  a  search  engine  company  or  a 
major  Internet  service  provider  —  any 
of  which  might  be  a  possibility,  Kahle 
said. 


Browser  and  search  firms  are  snap- 
ping up  technology  that  improves  Web 
navigation.  Search  company  Lycos  last 
week  spent  $39.75  million  for  Wise  Wire, 
which  automatically  organizes  Internet 
content  into  directories  and  categories. 
Last  month  Microsoft  shelled  out  a  re- 
ported $40  million  for  Firefly,  which  rec- 
ommends content  to  Web  surfers  based 
on  profiles  they  submit. 

Kahle  already  has  a  track  record  of 
creating  next-step  Internet  technology. 
In  the  early  1990s,  he  developed  the 
Wide  Area  Information  Server  (WAIS), 
the  first  system  for  publishing  quanti- 
ties of  data  in  a  searchable  form  on  the 
Internet. 

Impressive  Background 

The  New  York  Times,  Wall  Street 
Journal  and  Encyclopaedia  Britannica 
were  among  its  customers.  Kahle  later 
sold  WAIS  to  America  Online  for  $15  mil 
lion  in  1995. 

Besides  an  impressive  programming 
background,  which  includes  a  degree 
from  the  Massachusetts  Institute  of 
Technology  and  a  stint  designing  super- 
computers at  Thinking  Machines  Corp., 
Kahle  has  an  abiding  interest  in  tradi- 
tional media. 

His  hobby  is  letterpress  printing. 
Painstakingly  aligning  individual  lead 
letters  by  hand  to  make  cards  and  docu- 
ments is  a  far  cry  from  computer  auto- 
mation, "but  that's  the  charm,"  he  said. 

His  wife,  Mary  Austin,  is  the  founder 
and  curator  of  the  San  Francisco  Center 
for  the  Book,  which  runs  programs  and 
classes  to  encourage  "all  arts  of  the  visi- 
ble word." 

Type  Pestgner's  Legocy 

They  named  their  SVz-year-old  son 
Caslon  after  an  18th  century  type  de- 
signer. Their  9-month-old  son  Logan  has 
a  family  name. 

"When  the  printing  press  came 
about,  it  fostered  thousands  of  tiny 
presses  all  over  the  globe,  allowing  peo- 
ple in  small  towns  to  publish  and  distrib- 
ute information.  That's  what  we're  find- 
ing here  on  the  Web,"  he  said. 

"As  we  move  human  knowledge 
from  paper  to  computers,  people  are  get- 
ting access  to  huge  amounts  of  informa- 
tion more  easily.  But  to  help  organize 
the  Web  we  have  to  track  what's  on  it 
and  what's  going  on  over  time." 
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In  Attempting  to  Archive  the  Entire 
Internet,  a  Scientist  Develops  a  New  Way  to 
Search  It 

Non-profit  group  uses  'data  mining'  in  effort  to 
preserve  World-Wide  Web  content 


By  JEFFREY  SELINGO 


SAN  FRANCISCO 


For  110  years,  Building  116  served  unobtrusively  as  a 
general  store  and  as  quarters  for  non-commissioned 
officers  stationed  at  the  Presidio,  the  U.S.  Army  base 
south  of  the  Golden  Gate.  Now  that  the  base  has  been 
decommissioned.  Building  116  stands  out. 

Its  red-tile  roof  and  cream-colored  shiplap  siding  have 
been  restored,  for  one  thing.  And  it's  one  of  the  few 
buildings  still  in  use  on  the  1,480-acre  property,  which 
became  a  national  recreation  area  in  1995.  What  makes 
Building  1 16  unique,  though,  is  what's  inside  ~  a  virtual 
preservation  project  that  aims  to  create  a  complete 
archive  of  the  Internet. 

In  a  back  room,  a  buzz  comes  from  giant  computers  that 
are  "data  mining"  the  World-Wide  Web  and  Usenet 
discussion  groups,  finding  and  recording  pages  of  the 
computer  network  on  a  nearby  digital-tape  machine. 
The  computers  take  complete  snapshots  of  the  Web 
every  two  months,  allowing  users  to  find  pages  long 
after  their  owners  have  taken  them  down  and  let  their 
hyperlinks  lapse.  So  far,  the  archive  has  compiled  eight 
terabytes  of  data  -  the  equivalent  of  800,000  books  - 
and  has  recorded  at  least  three  snapshots  of  more  than 
500,000  Web  sites. 

Inside  those  virtual  books  is  the  patchwork  history  of 
ordinary  people:  pages  of  college  students  long 
graduated;  Web  sites  of  political  campaigns  since 
forgotten;  early,  awkward  versions  of  sites  that  are  now 
well-known;  infamous  sites  that  held  our  attention  for 
weeks,  such  as  the  Heaven's  Gate  cult's  page. 


ALSO  SEE: 

The  World-Wide  Web  site 
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Brewster  Kahle  started  the 
non-profit  Internet  Archive 
after  selling  his  previous 
venture,  the  Wide  Area 
Information  Server,  to 
America  Online  for 
$15-million. 


Just  as  the  Internet  has 
allowed  all  kinds  of 
ordinary  people  to  become 
their  own  publishers,  it  has 
allowed  a  computer 
scientist  named  Brewster 
Kahle  {left)  to  create  the 
non-profit  Internet  Archive. 

Mr.  Kahle  decided  to  save 
the  Internet's  contents  for 
posterity  after  selling  his 
previous  venture,  the  Wide 
Area  Information  Server,  to 
America  Online  for 
$15-minion.  The  system, 
which  he  invented,  makes  it 
easier  to  search  electronic 
data  bases. 


Why  an  archive?  "We  need  to  preserve  this  heritage," 
says  Mr.  Kahle,  an  affable  and  enthusiastic  37-year-old 
who  is  a  graduate  of  the  Massachusetts  Institute  of 
Technology.  "Or  one  day,  digital  anthropologists  will 
wonder  if  we  ever  learned  anything  from  the  history  of 
other  inventions.  Remember,  nobody  recorded 
television  in  the  early  days." 

Once  he  started  collecting  all  that  information,  Mr. 
Kahle  says,  he  realized  how  difficult  finding  things  on 
line  was  becoming,  with  the  number  of  Web  sites 
doubling  every  six  months  even  as  other  material  falls 
into  neglect.  So  he  set  about  creating  a  Web  search 
engine  using  the  technology  he  developed  to  manage 
the  massive  amounts  of  data  he  was  collecting  for  his 
quirky  history  project. 

The  result  is  Alexa,  a  search  engine  operated  by  Alexa 
Internet,  the  for-profit  company  that  is  part  of  the 
Internet  Archive.  "This  will  change  the  way  that 
researchers  use  the  Internet,"  Mr.  Kahle  says. 

Alexa  is  software  that  can  be  retrieved  free  from  the 
company's  Web  site  (http://www.alexa.com)  and  added 
to  a  Web  browser.  Unlike  other  search  engines,  such  as 
Yahoo!  and  Excite,  it  doesn't  rely  on  word  searches. 
Instead,  it  watches  where  its  users  go  on  the  Internet, 
and  then  records  that  information  in  a  central  data  base. 
Based  on  that  information,  Alexa  can  tell  a  user  the 
most  popular  paths  that  other  Alexa  users  have  taken 
from  the  site  the  user  is  visiting  at  a  given  time. 

It  also  can  suggest  other  sites  offering  related  material. 
The  top  1 0  sites  pop  up  in  a  thin,  gray  bar  near  the 
browser  {see  below)  and  change  as  the  user  moves  from 
page  to  page. 
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For  example,  from  the  "Perseus  Project" 
(http://www.perseus.tufts.edu).  a  site  with  an  extensive 
collection  of  ancient  Greek  texts  in  translation,  Alexa 
points  the  user  to  sites  about  classicists  and 
Mediterranean  archaeologists  at  the  University  of 
Michigan,  sites  about  publishers  and  journals  available 
electronically,  sites  about  Hellenistic  linguistics,  and  to 
"Project  Gutenberg,"  an  Internet  producer  of  free 
electronic  texts. 

Since  October,  more  than  200,000  people  have 
downloaded  Alexa.  The  service,  which  Mr.  Kahle  hopes 
will  soon  be  frilly  supported  by  advertising  revenue,  is 
not  yet  turning  a  profit.  But  by  the  end  of  the  year,  he 
expects  it  to  have  a  million  users.  Eventually,  he  hopes 
to  use  the  profits  from  Alexa  to  finance  the  gathering  of 
data  for  the  archive. 

The  advantage  of  Alexa  as  a  search  engine  is  that  it 
"attempts  to  be  an  objective  source"  for  people  seeking 
information.  Where  conventional  hnks  are  chosen  by  a 
page's  creator  according  to  what  the  creator  knows  and 
prefers,  Alexa  also  brings  other  Web  users'  knowledge 
and  preferences  to  bear.  The  sites  recommended  in  a 
given  search  sometimes  change,  depending  on  the 
surfing  patterns  of  Alexa  users. 

The  system  has  its  oddities.  If  users  frequently  traveled 
from  the  "Perseus  Project"  to,  say.  The  New  York  Times, 
the  newspaper  could  be  added  to  the  top- 10  list  of  an 
Alexa  user  looking  at  the  Perseus  site,  even  though  the 
only  thing  the  two  sites  have  in  common  is  their  users. 
In  fact,  such  a  situation  has  already  occurred.  From  the 
Perseus  site,  Alexa  suggests  --  based  on  other  users' 
habits  --  visits  to  the  sites  of  Franklin  and  Marshall 
College  and  Bates  College.  Alexa  officials  say  students 
at  the  two  colleges  probably  use  the  Perseus  site  in  their 
classes. 

"It's  sometimes  random  and  not  always  perfect,"  Mr. 
Kahle  says.  "But  if  researchers  use  a  traditional  search 
engine,  they  may  miss  some  of  the  best  sites."  A  search 
engine  such  as  Excite,  using  the  keywords  "Greek  texts" 
to  find  sites  related  to  the  "Perseus  Project,"  turned  up 
268,057  matches.  "With  Alexa,  you're  bound  to  hit  at 
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least  some  of  the  top  sites,"  Mr.  Kahle  adds. 

Still,  one  needs  a  traditional  search  engine  or  a  specific 
Web  address  to  get  started,  Mr.  Kahle  acknowledges. 
And  Alexa,  unlike  the  Alta  Vista  search  engine  and 
others,  can  suggest  linking  only  to  entire  Web  sites,  not 
to  specific  pages  within  them. 

"I  don't  think  of  it  in  the  same  way  as  a  search  engine  — 
it's  a  supplement,"  says  Bruce  Livett,  a  reader  and 
deputy  head  of  the  biochemistry  and  molecular-biology 
department  at  the  University  of  Melbourne,  in  Australia. 
"Alexa  gives  you  relevant  sites  in  the  general  sense, 
sites  that  you  sometimes  miss  because  other  search 
engines  depend  on  specific  keywords  you  enter." 

Dr.  Livett,  who  has  been  using  Alexa  since  October, 
surfs  the  Web  to  keep  up  with  the  research  work  of 
colleagues  around  the  world.  "It's  competitive  work,  and 
I  need  to  know  what  they're  doing."  Alexa,  he  says,  has 
alerted  him  to  research  sites  that  did  not  turn  up  in 
searches  using  Excite  and  Anzwers,  a  search  engine 
designed  for  Web  users  in  Australia  and  New  Zealand. 

Part  of  Alexa's  appeal,  he  says,  is  access  to  the  Internet 
Archive.  When  Alexa  users  get  a  dreaded  "404  --  file 
not  found"  error,  they  can  click  on  a  button  on  Alexa's 
tool  bar  and  pull  up  the  missing  page  from  the  archive. 
Using  the  archive,  Mr.  Livett  found  an  audio  interview 
he  needed  that  had  been  removed  from  a  Web  site. 

Alexa  is  "an  immediate  use  for  the  archive,"  says  Mr. 
Kahle,  adding  that  the  archive  is  the  component  that  he 
expects  will  eventually  separate  his  search  engine  from 
the  rest  of  the  pack.  Alexa  also  offers  a  direct  link  to  the 
Encyclopaedia  Bhtannica  Web  site,  allowing  users  to 
retrieve  reference  information  without  leaving  the  Web 
page  they  are  viewing  at  the  time. 

As  more  people  begin  to  use  Alexa  and  the  archive, 
however,  tricky  questions  about  copyright  and  privacy 
have  begun  to  crop  up.  An  e-mail  discussion  list  for 
Web  publishers  recently  included  a  heated  debate  about 
copyright  issues  surrounding  old  newspaper  articles  that 
are  part  of  the  archive. 

The  data-mining  computers  skip  Web  pages  that  require 
passwords,  as  well  as  Web  sites  protected  by  the 
Standard  for  Robot  Exclusion,  which  blocks  search 
engines  from  copying  pages  or  directories.  Still,  some 
Web  publishers  said  in  the  e-mail  discussion  that  Alexa 
officials  should  be  asking  on-line  newspapers  and 
journals  if  they  want  to  be  part  of  the  archive,  instead  of 
forcing  them  to  block  Alexa  from  copying  pages. 

How  the  archive  will  be  used  in  the  long  term  is  not 
clear.  Mr.  Kahle  often  mentions  the  early  days  of 
television,  v  'ten  programs  were  broadcast  live  and 
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recording  technology  was  primitive.  "When  it  comes  to 
a  point  where  users  have  a  camcorder  recording  the  Net, 
then  the  archive  won't  be  worth  it,"  he  says. 

In  Building  1 16,  the  archive  is  stored  in  a  digital-tape 
library  that  looks  like  a  vending  machine.  The  tapes 
currently  have  the  capacity  to  hold  20  terabytes  of  data 
in  all,  about  as  much  information  as  is  in  the  Library  of 
Congress.  So  much  content  is  being  added  to  the 
Internet  that  the  archive  grows  by  about  a  terabyte  of 
data  each  month.  The  data-mining  computers  are  able  to 
adjust  their  site  visits  to  concentrate  on  those  that 
change  most  frequently.  They  will  come  upon  a  site, 
however,  only  if  Alexa  users  have  visited  it,  if  anyone 
else  on  the  Web  has  linked  to  it,  or  if  it  is  listed  with  a 
directory  service. 

While  the  archive  has  been  able  to  keep  up  with  textual 
information  on  the  Internet  ~  it  is  complete  from 
October  1996  to  the  present  —  the  effort  to  collect 
images  is  running  a  few  months  behind. 

Through  Alexa,  the  archive  receives  about  14  requests 
for  old  pages  every  second.  Not  bad,  its  founder  says, 
when  one  considers  that  only  about  9,000  people  visit 
the  San  Francisco  Public  Library  on  an  average  day. 
Alexa  also  helps  researchers  by  listing  facts  about  the 
site  they're  visiting:  the  address  of  the  individual, 
company,  or  other  organization  that  owns  the  server  on 
which  the  site  is  located;  how  many  people  have  visited 
the  site;  how  frequently  the  site  is  updated;  how  fast  its 
computers  are;  and  how  many  pages  the  site  contains. 
Alexa  also  allows  users  to  vote  for  their  favorite  sites 
and  keeps  a  running  total  on  each  site. 

Mr.  Kahle  says  Alexa  does  not  keep  individual  statistics 
on  its  users.  Although  the  search  engine  tracks  the  paths 
of  users  as  they  jump  from  site  to  site,  it  does  not  record 
users'  names.  "We  don't  care  who  you  are,"  he  says. 
"We  just  care  what  path  you  take." 

Mr.  Kahle  dreams  that  Alexa  could  become  as  popular 
—  and  as  profitable  ~  as  search  engines  like  Yahoo!  and 
Excite.  And  the  Internet  Archive,  he  says  with 
enthusiasm,  could  become  part  of  a  large  research 
library,  although  he's  not  sure  how.  "I  don't  think  about 
the  details,"  he  says.  "That's  why  we're  doing  something 
now  that  others  thought  was  impossible,  or  even  crazy." 
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Information  Technology 


A  New  Archive  and  Internet  Search  Engine 
May  Change  the  Nature  of  On-Line  Research 

Non-profit  group  uses  'data  mining'  in  attempt  to  preserve  World-Wide  Web  content 


BY  JEFFREY  SELINCO 

SAN  FRANCISCO 

FOR  no  YEARS,  Building  116  served 
unobtrusively  as  a  general  store  and 
as  quarters  for  non-commissioned 
officers  stationed  at  the  Presidio,  the  U.S. 
Army  base  south  of  the  Golden  Gate.  Now 
that  the  base  has  been  decommissioned. 
Building  1 16  stands  out. 

Its  red-tile  roof  and  cream-colored  ship- 
lap  siding  have  been  restored,  for  one 
thing.  And  it's  one  of  the  few  buildings  still 
in  use  on  the  1,480-acre  property,  which 
became  a  national  recreation  area  in  1995. 
What  makes  Building  1 16  unique,  though, 
is  what's  inside^a  virtual  preservation 
project  that  aims  to  create  a  complete  ar- 
chive of  the  Internet. 

In  a  back  room,  a  buzz  comes  from  giant 
computers  that  are  "data  mining"  the 
World-Wide  Web  and  Usenet  discussion 
groups,  findmg  and  recording  pages  of  the 
computer  network  on  a  nearby  digital-tape 
machine.  The  computers  take  complete 
snapshots  of  the  Web  every  two  months, 
allowing  users  to  find  pages  long  after  their 
owners  have  taken  them  down  and  let  their 
hyperlinks  lapse.  So  far,  the  archive  has 
compiled  eight  terabytes  of  data— the 
equivalent  of  800,000  books— and  has  re- 
corded at  least  three  snapshots  of  more 
than  500,000  Web  sites. 


BrewstL'r  Kahic  started  the  non-profit  Internet  Arehive  afh  i  s(  //  n^  hi\  nit  i  loin 
venture,  the  Wide  Areii  Infornuilion  Server,  to  Anieriea  OnUne  for  $1 5-nullion. 


DOUBLING  EVERY  6  MONTHS 

Inside  those  virtual  books  is  the  patch- 
work history  of  ordinary  people:  pages  of 
college  students  long  graduated;  Web  sites 
of  political  campaigns  since  forgotten;  ear- 
ly, awkward  versions  of  sites  that  are  now 
well-known;  infamous  sites  that  held  our 
attention  for  weeks,  such  as  the  Heaven's 
Gate  cult's  page. 

Just  as  the  Internet  has  allowed  all  kinds 
of  ordinary  people  to  become  their  own 
publishers,  it  has  allowed  a  computer  sci- 
entist named  Brewster  Kahle  to  create  the 
non-profit  Internet  Archive. 

Mr.  Kahle  decided  to  save  the  Internet's 
contents  for  posterity  after  selling  his 
previous  venture,  the  Wide  Area  Informa- 
tion Server,  to  America  Online  for  $15- 
million.  The  system,  which  he  invented, 
makes  it  easier  to  search  electronic  data 
bases. 

Why  an  archive?  "We  need  to  preserve 
this  heritage,"  says  Mr.  Kahle,  an  affable 
and  enthusiastic  37-year-old  who  is  a  grad- 
uate of  the  Massachusetts  Institute  of 
Technology.  "Or  one  day,  digital  anthro- 
pologists will  wonder  if  we  ever  learned 
anything  from  the  history  of  other  inven- 
tions. Remember,  nobody  recorded  televi- 
sion in  the  early  days." 

Once  he  started  collecting  all  that  infor- 
mation, Mr.  Kahle  says,  he  realized  how 
difficult  finding  things  on  line  was  becom- 
ing, with  the  number  of  Web  sites  doubling 
every  six  months  even  as  other  material 
falls  into  neglect.  So  he  set  about  creating  a 
Web  search  engine  using  the  technology  he 
developed  to  manage  the  massive  amounts 
of  data  he  was  collecting  for  his  quirky 
history  project. 

The  result  is  Alexa,  a  search  engine  op- 
erated by  Alexa  Internet,  the  for-profit 
company  that  is  part  of  the  Internet  Ar- 
chive. "This  will  change  the  way  that  re- 
searchers use  the  Internet,"  Mr.  Kahle 
says. 


THE  MOST  POPULAR  PATHS 

Alexa  is  software  that  can  be  retrieved 
free  from  the  company's  Web  site  (http:// 
www.alexa.com)  and  added  to  a  Web 
browser.  Unlike  other  search  engines, 
such  as  Yahoo!  and  Excite,  it  doesn't  rely 
on  word  searches.  Instead,  it  watches 
where  its  users  go  on  the  Internet,  and  then 
records  that  information  in  a  central  data 
base.  Based  on  that  information,  Alexa  can 
tell  a  user  the  most  popular  paths  that  other 
Alexa  users  have  taken  from  the  site  the 
user  is  visiting  at  a  given  time. 

It  also  can  suggest  other  sites  offering 
related  material.  The  top  10  sites  pop  up  in 
a  thin,  gray  bar  at  the  bottom  of  the  brows- 
er and  change  as  the  user  moves  from  page 
to  page. 

For  example,  from  the  "Perseus  Proj- 
ect" (http://www.perseus.tufts.edu),  a  site 
with  an  extensive  collection  of  ancient 
Greek  texts  in  translation,  Alexa  points  the 
user  to  sites  about  classicists  and  Mediter- 
ranean archaeologists  at  the  University  of 
Michigan,  sites  about  publishers  and  jour- 
nals available  electronically,  sites  about 
Hellenistic  linguistics,  and  to  "Project  Gu- 
tenberg," an  Internet  producer  of  free 
electronic  texts. 

Since  October,  more  than  200,000  peo- 
ple have  downloaded  Alexa.  The  service, 
which  Mr.  Kahle  hopes  will  soon  be  fully 
supported  by  advertising  revenue,  is  not 
yet  turning  a  profit.  But  by  the  end  of  the 
year,  he  expects  it  to  have  a  million  users. 
Eventually,  he  hopes  to  use  the  profits 


from  Alexa  to  finance  the  gathering 
of  data  for  the  archive. 

'OBJECTIVE  source' 

The  advantage  of  Alexa  as  a 
search  engine  is  that  it  "attempts  to 
be  an  objective  source"  for  people 
seeking  information.  Where  con- 
ventional links  are  chosen  by  a 
page's  creator  according  to  what 
the  creator  knows  and  prefers, 
Alexa  also  brings  other  Web  users' 
knowledge  and  preferences  to 
bear.  The  sites  recommended  in  a 
given  search  sometimes  change, 
depending  on  the  surfing  patterns 
of  Alexa  users. 

The  system  has  its  oddities.  If 
users  frequently  traveled  from  the 
"Perseus  Project"  to,  say.  The 
New  York  Times,  the  newspaper 
could  be  added  to  the  top-IO  list  of 
an  Alexa  user  looking  at  the  Per- 
seus site,  even  though  the  only 
thing  the  two  sites  have  in  common 
is  their  users.  In  fact,  such  a  situa- 
tion has  already  occurred.  From 
the  Perseus  site,  Alexa  suggests — 
based  on  other  users'  habits — vis- 
its to  the  sites  of  Franklin  and  Mar- 
shall College  and  Bates  College. 
Alexa  officials  say  students  at  the 
two  colleges  probably  use  the  Per- 
seus site  in  their  classes. 

"It's  sometimes  random  and  not 
always  perfect,"  Mr.  Kahle  says. 
"But  if  researchers  use  a  tradition- 
al search  engine,  they  may  miss 
some  of  the  best  sites."  A  search 
engine  such  as  Excite,  using  the 
keywords  "Greek  texts"  to  find 
sites  related  to  the  "Perseus  Proj- 
ect," turned  up  268,057  matches. 
"With  Alexa.  you're  bound  to  hit 
at  least  some  of  the  top  sites,"  Mr. 
Kahle  adds. 

Still,    one    needs    a    traditional 
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The  Shareware  Report 

Alexa  can 
find  sites 
you'd  like 

We've  heard  so  much  about . 
this  AJexa^tool  that  wc  had  to 
try  It 

AJexa,  according  to  the 
help  file.  IS  "a  free  Internet 
naviRaticn  service  that  helps 
you  surf  the  Web  smarter, 
faster  and  easier  TTie  service 
helps  you  determine  the  vulue 
of  web  sites  you  visit  through 
a  variety  of  statiRtics  and 
facta  about  each  mte  And  then 
recommends  links  of  whero 
you  might  wont  to  ]?o  next- 
It  certainly  does  that,  per- 
haps ad  nouseum.  The  tool- 
har,  which  Alexa  pinctxi  »l  the 
screen  a  bottom,  has  many 
choices  If  you  find  it  intruaive, 
mmimiic  it  on  the  taskbar,  or 
select  autohide.  It  suggests, 
according  to  your  current  Web  ' 
aitc,  where  yoft  might  ijo  next. 
It  even  lets  you  vote  on 
whether  you  like  a  Web  pa^, 
then  archives  it.  You  can  also 
add  links  to  the  hnt 

AJexa  provides  a  button  to 
the  EncydopccUfl.Dritannira 
3ite,"h«ndy  for  quick  reference, 
especially  wth  ila  search  win- 
dow If  you  nm  both  m^or 
browsers,  it  works  with  either, 
notjust  your  default, 

A  better  feature  is  the 
archival  referencp;  if  you 
receive  the  drended  "404  URL 
net  found"  response  in  your 
browser,  aak  Alexo  to  retrieve 
the  last  vuhd  page  for  th«t 
address  AJexa  i<<  one  of  the 
best  integrated  desktop/Inter- 
net  navigation  accessories 
we've  seen, 

Wheto  to  find  iti 
www  alexa  com 
Price:  Free 

Syetem  Requircmenta:  A 
PCrunninf  Window6  3.11or  ■ 
95  and  an  Internet 

Bill  Dubie  and  Dave  Sciuto  . 
co-host  "The  Computer 
Report  on  WCAPAM.980  in 
Lowell  on  Sundays  at  7  om. 
Check  out  their  Website  ot 
http //thereport.cora/ or  con-     . 


search  engine  or  a  specific  Web  ad- 
dress to  get  started,  Mr.  Kahle  ac- 
knowledges. And  Alexa,  unlike  the 
Alta  Vista  search  engine  and  oth- 
ers, can  suggest  linking  only  to  en- 
tire Web  sites,  not  to  specific  pages 
within  them. 

'a  supplement' 

"I  don't  think  of  it  in  the  same 
way  as  a  search  engine — it's  a  sup- 
plement," says  Bruce  Livett,  a 
reader  and  deputy  head  of  the  bio- 
chemistry and  molecular-biology 
department  at  the  University  of 
Melbourne,  in  Australia.  "Alexa 
gives  you  relevant  sites  in  the  gen- 
eral sense,  sites  that  you  some- 
times miss  because  other  search 
engines  depend  on  specific  key- 
words you  enter." 

Dr.  Livett,  who  has  been  using 
Alexa  since  October,  surfs  the 
Web  to  keep  up  with  the  research 
work  of  colleagues  around  the 
world.  "It's  competitive  work,  and 
I  need  to  know  what  they're  do- 
ing." Alexa,  he  says,  has  alerted 
him  to  research  sites  that  did  not 
turn  up  in  searches  using  Excite 
and  Anzwers,  a  search  engine  de- 
signed for  Web  users  in  Australia 
and  New  Zealand. 

Part  of  Alexa's  appeal,  he  says, 
is  access  to  the  Internet  Archive. 
When  Alexa  users  get  a  dreaded 
"40-1 — file  not  found"  error,  they 
can  click  on  a  button  on  Alexa's 
tool  bar  and  pull  up  the  missing 
page  from  the  archive.  Using  the 
archive,  Mr.  Livett  found  an  audio 
interview  he  needed  that  had  been 
removed  from  a  Web  site. 

Alexa  is  "an  immediate  use  for 
the  archive,"  says  Mr.  Kahle,  add- 
ing that  the  archive  is  the  compo- 
nent that  he  expects  will  eventually 
separate  his  search  engine  from  the 
rest  of  the  pack.  Alexa  also  offers  a 
direct  link  to  the  Encyclopaedia 
Britannica  Web  site,  allowing  us- 
ers to  retrieve  reference  informa- 
tion without  leaving  the  Web  page 
they  are  viewing  at  the  time. 


(QUESTIONS  OF  COPYRIGHT 

As  more  people  begin  to  use 
Alexa  and  the  archive,  however, 
tricky  questions  about  copyright 
and  privacy  have  begun  to  crop 
up.  An  e-mail  discussion  list  for 
Web  publishers  recently  included 
a  heated  debate  about  copyright 
issues  surrounding  old  newspaper 
articles  that  are  part  of  the  ar- 
chive. 

The  data-mining  computers  skip 
Web  pages  that  require  passwords, 
as  well  as  Web  sites  protected  by 
the  Standard  for  Robot  Exclusion, 
which  blocks  search  engines  from 
copying  pages  or  directories.  Still, 
some  Web  publishers  said  in  the  e- 
mail  discussion  that  Alexa  officials 
should  be  asking  on-line  newspa- 
pers and  journals  if  they  want  to  be 
part  of  the  archive,  instead  of  forc- 
ing them  to  block  Alexa  from  copy- 
ing pages. 

How  the  archive  will  be  used  in 
the  long  term  is  not  clear.  Mr. 
Kahle  often  mentions  the  early 
days  of  television,  when  programs 
were  broadcast  live  and  recording 
technology  was  primitive.  "When 
it  comes  to  a  point  where  users 
have  a  camcorder  recording  the 


"If  researchers 
use  a  traditional 


search  engine, 
they  may  miss 
some  of  the  best 
sites." 


Net,   then   the  archive   won't   be 
worth  it,"  he  says. 

In  Building  116,  the  archive  is 
stored  in  a  digital-tape  library  that 
looks  like  a  vending  machine.  The 
tapes  currently  have  the  capacity 
to  hold  20  terabytes  of  data  in  all, 
about  as  much  information  as  is  in 
the  Library  of  Congress.  So  much 
content  is  being  added  to  the  Inter- 
net that  the  archive  grows  by  about 
a  terabyte  of  data  each  month.  The 
data-mining  computers  are  able  to 
adjust  their  site  visits  to  concen- 
trate on  those  that  change  most  fre- 
quently. They  will  come  upon  a 
site,  however,  only  if  Alexa  users 
have  visited  it,  if  anyone  else  on 
the  Web  has  linked  to  it,  or  if  it  is 
listed  with  a  directory  service. 

14  REQUESTS  PER  SECOND 

While  the  archive  has  been  able 
to  keep  up  with  textual  information 
on  the  Internet— it  is  complete 
from  October  1996  to  the  present— 
the  effort  to  collect  images  is  run- 
ning a  few  months  behind. 

Through  Alexa,  the  archive  re- 
ceives about  14  requests  for  old 
pages  every  second.  Not  bad    its 


founder  says,  when  one  considers 
that  only  about  9,000  people  visit 
the  San  Francisco  Public  Library 
on  an  average  day.  Alexa  also 
helps  researchers  by  listing  facts 
about  the  site  they're  visiting:  the 
address  of  the  individual,  compa- 
ny, or  other  organization  that  owns 
the  server  on  which  the  site  is  lo- 
cated; how  many  people  have  visit- 
ed the  site;  how  frequently  the  site 
is  updated;  how  fast  its  computers 
are;  and  how  many  pages  the  site 
contains.  Alexa  also  allows  users 
to  vote  for  their  favorite  sites  and 
keeps  a  running  total  on  each  site. 
Mr.  Kahle  says  Alexa  does  not 
keep  individual  statistics  on  its  us- 
ers. Although  the  search  engine 
tracks  the  paths  of  users  as  they 
jump  from  site  to  site,  it  does  not 
record  users'  names.  "We  don't 
care  who  you  are,"  he  says.  "We 
just  care  what  path  you  take." 

Mr.  Kahle  dreams  that  Alexa 
could  become  as  popular — and  as 
profitable — as  search  engines  like 
Yahoo!  and  Excite.  And  the  Inter- 
net Archive,  he  says  with  enthusi- 
asm, could  become  part  of  a  large 
research  library,  although  he's  not 
sure  how.  "1  don't  think  about  the 
details,"  he  says.  "That's  why 
we're  doing  something  now  that 
others  thought  was  impossible,  or 
even  crazy."  ■ 


*  Links  to  the  Internet  Archive  site 
can  be  found  on  The  Chronicle' <, 
World-Wide  Web  site  at: 

http:llchronicle.coin 
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The  Digital  Attic: 

An  Arcliive  of  Everytliing 

Before  the  Internet,  you  lost  data  nearly  every 

time  you  upgraded  your  computer.  Now  you  couldn't  get 

rid  of  that  embarrassing  E-mail  if  you  tried. 
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FAST  FORWARD 


YOU  PROBABLY  HAVEN'T  SPENT  MUCH 
time  worrying  about  what  will  happen 
to  your  Web  site  when  you're  dead. 
That's  all  right.  David  Blatner  is  worry- 
ing for  you.  "I  keep  thinking,"  he  says,  "if  my 
grandparents  had  built  a  Web  site,  wouldn't  I 
svant  it  archived  and  available  on  the  Net  in  the 
years  to  come  for  their  grandchildren?"  So  he  is 
ready  to  help  with  his  new  Web-preservation  or- 
ganization in  Seattle:  Afterlife. 

Meanwhile,  in  central  Ohio,  a  site  called  Or- 
phans of  the  Net  is  "rescuing"  some  Web  pages 
that  have  been  abandoned  or  shut  down  —  gen- 
erally shrines  for  minor  celebrities.  If  you're 
looking  for  old  publicity  photos  of  Kimberly 
Williams  or  Renee  Zellweger,  rest  assured  that 
they  have  been  preserved  on  line. 

These  modest  salvage  jobs  notwithstanding, 
many  of  the  world's  librarians,  archivists  and  In- 
ternet experts  are  warning  that  the  record  of  our 
blooming  digital  culture  is  heading  for  oblivion, 
and  fast.  They  note  that  we  have  already  begun 
losing  scientific  data  and  busmess  records  — 
stored  on  ancient  punch  cards  or  written  in  dead 
computer  languages  or  encoded  on  decaying  Uni- 
vac  Type  II-A  magnetic  tape.  (Just  try  to  find  a 
Univac  tape  reader  when  you  need  one.) 

In  the  electronic  era,  we  are  stockpiling  our 
heritage  on  millions  of  floppy  disks,  hard  drives 
and  CD-ROM's.  These  flaky  objects  go  obsolete 
dismayingly  fast,  with  new  technologies  rolling 
in  on  product  cycles  as  short  as  two  to  five  years. 

"There  has  never  been  a  time  of  such  drastic 
and  irretrievable  information  loss,"  says  Stewart 
Brand,  creator  of  the  "Whole  Earth  Catalog"  a 
generation  ago  and  an  organizer  of  a  sobering 
conference  earlier  this  year  called  'Time  and 
Bits."  Our  collective  memory  is  already  begin- 
ning to  fade  away,  many  of  the  participants  be- 
lieve. Future  archeologists  will  find  our  pottery 
but  not  our  E-mail.  "We've  turned  into  a  total 
amnesiac,"  Brand  says.  "We  do  short-term  mem- 
ory, period." 

The  information-storage  medium  of  the  past 
couple  of  millenniums  —  for  words  not  writ  in 


stone,  anyway  —  has  of  course  been  paper  Paper 
does  decay  with  time,  and  it  is  fragile.  One  big  fire 
at  the  library  at  Alexandria  in  391  A.D.  destroyed 
a  calamitous  piece  of  the  ancient  world's  heritage. 
But  to  some  people,  paper  stans  to  look  good. 

"Paper  at  least  degrades  gracefully,"  says 
Brand  nostalgically.  "Digital  files  are  utterly  brit- 
tle; they're  complexly  immersed  in  a  temporary 
collusion  of  a  certain  version  of  a  certain  applica- 
tion running  on  a  certain  version  of  a  certain  op- 
erating system  in  a  certain  generation  of  a  certain 
box,  and  kept  on  a  certain  passing  medium  such 
as  S'/t-inch  floppy."  If  a  company  has  digital 
business  records  a  mere  decade  old,  what  are  the 
chances  that  it  has  also  stored  a  vintage  1988  per- 
sonal computer,  DOS  2.1,  and  the  correct  ver- 
sion of  Lotus  1-2-3? 

Some  companies  have  begun  "refreshing" 
their  aging  records  by  continually  copying  them 
onto  new  storage  media,  using  new  software. 
Refreshing  isn't  easy,  and  most  institutions  have 
not  yet  realized  that  it  may  be  necessary.  What- 
ever media  they  used  to  save  their  digital  infor- 
mation, they  will  not  be  able  to  read  it  without  a 
machine  —  a  finicky  antique,  most  likely.  With 
paper,  all  you  need  is  your  eyes. 

Perhaps  the  speed  and  richness  of  the  Internet 
have  lulled  us,  letting  children  in  Boise  read 
Census  data  in  Washington  and  oral  history  in 
Hiroshima.  Words  swim  instantly  across  the 
network,  not  caring  about  the  mileage,  and  we 
don't  exactly  feel  information-deprived.  But  are 
we  sacrificing  longevity  to  gain  glut? 

"Back  when  information  was  hard  to  copy, 
people  valued  the  copies  and  took  care  of  them," 
says  Danny  Hiliis,  co-founder  of  Thinking  Ma- 
chines Corporation  and  now  vice  president  of  re- 
search at  Disney.  'TSfow  copies  are  so  common  as 
to  be  considered  worthless,  and  very  little  atten- 
tion is  given  to  preserving  them." 

It's  scary.  And  yet  .  .  . 

Anyone  wandering  through  the  Internet 
might  begin  to  feel  that  memory  loss  isn't  the 
problem.  Archivists  are  everywhere,  in  fact  — 
official  and  self-made.  On  Sunday  July  3,  1994,  I 


played  a  hand  of  bridge  that  would  be  best  for- 
gotten —  but  no,  the  leading  on-line  bridge  serv- 
ice, OKBridge,  has  recorded  every  detail  of  the 
bidding  and  card  play  in  each  of  the  seven  million 
hands  played  since  the  beginning  of  that  year. 

Likewise,  any  silly  message  that  you  broadcast 
to  any  Usenet  newsgroup  is  now  being  stored, 
for  eternity  or  some  approximation  thereof,  by  a 
variety  of  commercial  services.  No  matter  that 
you  gave  your  last  posting  a  mere  five  seconds' 
thought;  you  should  be  prepared  to  hear  your  bi- 
ographer read  it  back  to  you  in  your  dotage. 

Most  people,  unfortunately,  don't  have  post- 
erity in  mind  when  they  fire  off  their  little  notes. 
Internet  communication  seems  so  spontaneous 
and  personal.  Will  people  really  want  future  em- 
ployers to  dig  up  ail  the  messages  they've  been 
posting  to  alt.dead.porn.stars  and  soc. sup- 
port.depression. manic?  Sometimes,  as  the  years 
go  by,  privacy  demands  a  gentle  forgetfulness. 

Many  people  sitting  at  company  workstations 
toss  off  E-mail  as  casually  as  they  speak  —  gos- 
sipy E-mail,  secretive  E-mail,  snide  E-mail,  raun- 
chy E-mail,  E-mail  meant  to  self-destruct  after 
serving  its  instant  purpose.  But  it  lives  on,  as 
corporate  lawyers  and  prosecutors  have  realized. 
Neither  sender  nor  recipient  can  delete  it  reli- 
ably. To  the  lawyers'  occasional  horror  —  here 
comes  the  subpoena!  —  it  lingers  on  disk  drives 
and  backup  tapes  Hke  a  late-night  guest  who  has 
forgotten  how  to  leave. 

The  biggest  proprietor  of  archivable  data  is  the 
Federal  Government,  struggling  to  preserve  the 
records  it  generates  daily  on  an  uncountable  scale. 
It  is  a  matter  of  current  litigarion  whether  every 
piece  of  governmental  E-mail  must  be  preserved 
as  a  "Federal  record."  Either  way,  the  task  of  the 
National  Archives  and  Records  Administration  is 


monumental.  "What  we're 
looking  at  is  growth  that 
there's  no  way  we  can  deal 
with,  using  any  known  tech- 
nique or  resources  we  can 
c^et,"  says  Ken  Thibodeau, 
director  of  the  Archives' 
electromc  records  programs. 
"Digital  information 

technology  is  creating  major 
and  serious  challenges  for 
how  we're  going  to  preserve 
anything  of  our  culture  and 
our  history,"  Thibodeau 
says.  "It's  also  creating  op- 
portumties:  we'll  be  able  to 
preserve  and  use  a  lot  more 
information  than  ever  be- 
fore." Pity  the  poor  histori- 
an, though.  The  Clinton 
White  House's  E-mail 
alone  figures  to  be  8  million 
fUes. 

Meanwhile,  in  its  unoffi- 
cial way  the  Internet  is 
transforming  the  way  infor- 
mation is  stored.  The  tradi- 
tional function  of  libraries, 
gathering  books  for  per- 
manent storage  or  one-at-a- 
time  lending,  has  been  thor- 
oughly confused.  Archiving 
of  the  on-line  worid  is  not 
centrahzed.  The  network 
distributes  memory.  There 
is  a  kind  of  self-repUcation 
at  work,  with  data  employ- 
ing humans  in  the  effort  to 
spread  and  reproduce. 

Web  site  by  Web  site,  the 
data  seem  as  frail  as  skywrit- 
ing —  smoke  in  the  breeze. 
Brewster  Kahle,  inventor  of 
some  of  the  best  Internet 
search  systems,  estimates 
the  average  lifetime  of  a 
Web  page  at  75  days.  He  has 
created  the  Internet  Ar- 
chive, though,  to  store  pe- 
riodic snapshots  of  almost 
the  entire  Wodd  Wide  Web. 
It  maintains  pages  lost  or 
shut  down  by  their  owners. 
It  amounts  to  about  eight 
terabytes  of  data.  ("Tera-" 
is  1,000,000,000,000.  Get 
used  to  it.) 

Brand  and  his  fellow  Cas- 
sandras  have  a  point,  and 
they  are  focusing  attention 
on  some  new,  practical  is- 
sues. Who,  if  anyone,  will 
decide  which  parts  of  our 
culture  are  worth  preservmg 


for  the  hypothetical  arche- 
ologists  of  the  future?  Can 
any  identification  scheme 
help  readers  distinguish  true 
copies  from  false  copies  in 
the  on-line  world's  hall  of 
mirrors?  What  arrays  of  op- 
tical or  magnetic  disks 
might  provide  reliabiUty  and 
redundancy  for  more  than  a 
few  years  of  storage?  Still, 
hope  comes  from  the  simple 
truth  that  the  essence  of  in- 
formation does  not  lie  in 
any  technology,  new  or  old. 
It's  just  bits,  after  all. 

In  the  world  before  cy- 
berspace, countless  bridge 
hands  were  played  and 
words  spoken  and  the  mem- 
ory vanished  like  vapor  into 
the  air.  Think  of  all  that 
data,  dissolving  no  sooner 
than  it  was  formed.  Once  in 
a  while  people  managed  to 
snatch  a  bit  back  from  the 
ether,  with  pen  on  paper  or, 
later,  audio-  and  videotape. 
They  succeeded  in  saving  a 


fair  portion  of  what  was 
worth  saving:  the  speeches 
of  Lincoln  (the  major  ones), 
the  poetry  of  Shakespeare 
(but  not  quite  reliably),  the 
plays  of  Sophocles  (except 
the  lost  ones)  and  a  few 
dozen  terabytes  more. 

Everything     is     different 
now.    The    Internet    turns 
much   of  humanity  into  a 
sort  of  giant  organism  —  an 
intermittently  connected  in- 
formation-gathering     crea- 
ture —  and  really,  amnesia 
doesn't  seem  to  be  its  fatal 
flaw.  This   new  being  just 
can't  throw  anything  away 
It  is  obsessive.  It  has  for- 
gotten that  some  baggage  is 
better  left  behind.  Homo  sa- 
piens has  become  a  pack  rat. 
Shed  tears  if  you  must  for 
the  backup  tapes  already  de- 
magnetized.    You'll     have 
many    opportunities.    Just 
last    October,    the    Daioh 
Temple  of  Rinzai  Zen  Bud- 
dhism   held    a    "memorial 
service      for     lost      infor- 
mation" in  Kyoto  and  on 
line.  Of  course,  the  details 
are   lovingly   preserved,    m 
English  and  Japanese,  at  its 
Web  site.  ■ 
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Brewster  KoMe  downfooded 
on  entire  library  for  you 


If  you  have  a  lot  of  time  on 
your  hands,  and  about  8 
terabytes  (8000  gigabyte*)  of 
hard  drive  space,  you  could 
download  the  entire  World 
Wide  Web  That's  equivalent 
to  ail  the  books  in  the  Harold 
Washington  library. 

You'd  probably  be  bored 
with  the  whole  idea  by  the 
time  you  finished— a^ssumuig 
you  were  still  alive  Fortunate- 
ly, Brewster  Kahlc  has  already 
done  the  job  for  you. 

Kahle  was  an  early  Iniemet 
developer,  creator  of  Wide 
Area  Information  Server 
(WAIS),  a  system  that  preced- 
ed HTML  (hypertext  markup 
language)  as  a  way  of  pub- 
lishing and  iiidexing  Internet 
information. 

From  a  project  to  preserve 
Web  pages  for  posterity— the 
Internet  Archive— Kahlc 's 
company,  Alexa  Internet, 
developed  a  new  kind  of  In- 
teract tool  that  may  not  change 
the  way  we  use  the  Web,  but 
will  at  Icaat  add  a  povtxrful 
new  dimension  to  it. 

Named  after  the  lost  library 
of  Alexandria.  Alexa  is  a 
browser  add-on  that~  runs 
separately,  and  consists  of 
several  components.  Installing 
Alexa  in  Windows  95  or  NT 
will  place  its  toolbar  in  the  tray 
at  the  bottom  of  your  screen 
when  Windows  is  booted. 

When  you  start  your 
browser  the  toolbar  show$  the 
site  name.  Click  it  to  get  a 
variety  of  interesting  informa- 
tion about  the  site,  including 
the  name  and  address  of  its 
registered  owwer ,  and  the  date 
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on  which  it  was  registered. 

You  can  also  "vote"  on 
whether  you  like  or  dislike  the 
site,  and  there's  a  note  of  how 
many  other  Alexa  users  voted 
one  way  or  the  other. 

This  i&  useful  if  you  have 
some  quesuon  about  the  site's 
integrity,  such  as  when  you 
wish  to  purchase  something 
from  it. 

Alexa  also  provides  ratings 
on  the  site's  popularity,  cita- 
tions of  reviews  if  available, 
its  typical  loading  speed,  a 
"freshness"  rating  (how  often 
it's  updated),  and  how  many 
links  and  pages  it  has. 

Alexa  also  tries  to  recom- 
mend other  sites  similar  to  the 
one  you're  viewing,  by  pro- 
viding links  to  Sites  that  were 
followed  by  other  recent 
visitors. 

Because  Alexa  Internet  has 
downloaded  (and  occasional- 
ly updates  its  "snapshots"  oO 
the  entire  Web,  its  archive 
icon  can  bring  up  the  most 
recently  archived  version  of 
many  sites  that  have  moved, 
or  no  longer  exist. 

This  can  relieve  the  frequent 
frustration  of  encountering  the 
dreaded  "404"  (page  not 
found)  error  message 

There  is  a  handy  link  to  the 
Encyclopedia  Brittanica,  and 
the  Merriam-Webfiter  onii 


dictionary  and  thesaurus. 

Supported  by  small,  oon 
intrusive  ads  thai  appear  on  it 
toolbar,  Alexa's  price  is  right 
free  It's  a  fast  1.2mb  down 
load  at  www.alexa.com. 

You  can  te&t  its  informatio 
service  without  downloadin 
it:  set  your  browser  to  http:/ 
widener  alexa .  com/sitcdaw 
whitehouse.gov  for  a  sampl 
(substimte  any  site  you  wis 
instead  of  whitehouse.gov). 

With  Windows  95  be  sui 
you  have  a  version  of  the  fil 
KeTneJ32.dll  dated  later  tha 
Feb  2,  1996.  Use  the  Fio 
command  on  your  Start  men 
to  locate  it  and  check  its  dai< 
An  updated  version  can  i 
downloaded  from  Alexa's  siti 

Alexa  works  with  Netscaf 
or  Internet  Explorer  versior 
3  or  later.  It  also  works  we 
with  my  new  fovorite  browse 
Opera  (NNN  Feb.  28). 

Serious  surfers  won 
browse  without  it. 
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Alexa  is  your  window  to  the  whole  wide  web 

Get  maximum  mileage 
out  of  searcli  engines 


By  Cart  Lum 

Contributing  writer 


You've  finally  set  up  your  computer 
to  access  the  Internet,  but  how  do  you  find 
anything  in  this  vast  virtual  "library?" 
Search  engines  are  the  logical  places  to 
start.  Yahoo!  (www.yahoo.coni)  is  a  house- 
hold name  in  Internet  circles,  as  are  Excite 
(www.excite.com)  and  Alta  Vista  (www.al- 
tavisu.  com). 

Sometimes  these  engines  are  a  little  too 
thorough.  A  typical  search  may  result  in 
thousands  of  hits.  It  takes  a  determined  ef- 
fort to  separate  the  wheat  from  the  chaff. 
Browsing  through  categories  offered  by 
many  search  services  may  help,  but  it's  te- 
dious and  only  helpful  if  you're  looking  for 
something  general. 

This  IS  where  Alexa  comes  in.  In  an 
industry  where  "cool"  and  "unique"  are 
twin  Holy  Grails,  Alexa  is  an  ambitious 
undertaking  that's  free  and  designed  to  put 
human  intelligence  into  Web  navigation 
Like  any  good  guide,  it  eagerly  informs  the 
user  about  the  site  visited  and  quickly  rat- 
tles off  a  list  of  similar  sites. 

How  does  Alexa  do  this?  This  is  where 
the  ambitious  part  comes  in:  Alexa  takes  a 
"snapshot"  of  the  entire  Web  every  30  to  60 
days,  recording  the  entire  World  Wide 
Web.  It  takes  about  two  TB  (terabytes)  of 
disk  space  to  store  the  Web.  Alexa  has  about 
eight  terabytes  in  its  archives.  For  compari- 
son, a  typical  public  library  contains  about 
three  TB  of  information,  a  video  rental 
store  holds  about  eight  TB,  while  the  entire 
computer  generated  Toy  Story  movie  takes 
up  half  a  terabyte.  (A  terabyte  is  1,000  giga- 
bytes, or  1 ,000,000  megabytes.) 

In  addition  to  this  huge  archive,  Alexa 
uses  anonymous  "user  paths."  Every  time 
an  Alexa  client  is  used,  it  records  where  the 
user  went,  how  long  was  spent  at  each  sire, 


and  if  the  user  gave  the  site  a  thumbs  up  or 
thumbs  down.  It  also  notes  how  quickly  the 
site  loads,  how  often  it's  updated,  and  who 
owns  the  site. 

Alexa  uses  this  information  and  other 
sources  to  generate  its  Hst  of  "Where  to  Go 
Next"  sites.  And  unlike  the  thousands  of 
hits  returned  by  a  search  engine,  AJexa's  list 
IS  usually  no  more  than  a  dozen  highly  rele- 
vant, items. 

To  use  Alexa,  download  and  install  the 
client  software  from  www.alexa.com.  Dur- 
ing installation,  provide  general  demo- 
graphic information  about  yourself — Alexa 
supports  itself  solely  from  advertising  rev- 
enue. The  option  "Rather  not  say"  is 
thoughtfully  provided  if  you  prefer  not  to. 

UTien  you  begin  surfing,  Alexa  springs 
into  action,  displaying  useful  information  in 
a  toolbar  (or  "dashboard")  below  your 
browser: 

*  Where  You  Are  displays  the  web  site'.s 
popularity,  who  owns  the  site,  and  other 
nifty  information; 

*  Where  to  Go  Next  recommends  simi- 
lar sites  to  look  at; 

*  Archive  of  the  Web  retrieves  missing 
web  pages. 

*  Desktop  Reference  provides  instant 
access  to  the  Encyclopaedia  Britannica  and  the 
Meniam-  Webster  Dictionary  and  Thtsaurus. 

I  found  Alexa  responsive  and  solid  for  an 
early  version  software.  Its  site  recommen- 
dations were  interesting:  While  at  Mi- 
crosoft's site,  Alexa  suggested  that  I  visit 
Apple,  Sun,  and  Netscape — three  of  the 
most  rabid  anti-Microsoft  companies  out 
there.  However,  there  was  no  mention  of 
visiting  Microsoft  when  I  went  to  sites  for 
Apple  and  Netscape. 

Alexa  requires  a  486/33  running  Win- 
dows 95/NT.  A  Macintosh  version  is  in  de- 
velopment. 

carllum@uniserve.com 
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New  producU  help  coordirs^te 
several  search  engines  and 
yield  more  eUicienl  resufe. 

|)y  Ioiin  IIokkanlm 


w)ni  nil'  liMANOiNi.  liUi'iii.'i  ukI  ilic 
mvrnasc'J  usii  "I  pli^clronlc  data  slornK'' 
\\\  WW  iui..l.>...  Unv  otlu-fi.  vho  goal  of 
m»n;i>(iiii;  iIh-  knowli^dR*^  i.>li-v,iiit  to  lnw 
l>r:n  lin  lijs  bcKriur  iiii.TcaMn;;ly  clifll- 
(III!  I  urtvinultily.  Ill  ll>''  liilmici  ond  In- 
iraiicts  liiivo  boroDi.'  r>'»tforiiis  (or  publl- 
I  ;iiii)ii  and  lollftboi  A(ivo  sysK.-m  dcvolop- 
hiom,  in.iiiy  nww  scmrli  iudIs  Imve  ap- 
l»i-.<riU, 

I'asl  discusslnni  of  scartli  leclinolo- 
gics  hiivn  (nrutod  on  w)iii.h  major  Rearrh 
engine  was  the  hf^l  or  mosl  compreh«n- 
siv;.   Now   nf>v.   sinjil   dfisktop   soortli 
looli  ic^V  l<(  kvtriiRo  the  big  scurchiin- 
(iins    r<»po5l(oric!,    and    tackle    different      p 
kinds  iif  problem'!   Tlio  new  searrh  wld 
[!«ls  sock  tti  muku  senrrhing  niultipio  In-     ^' 
iprnct/liUraiiBi   search   ontfiima    a   Ibs*;    ; 
pAinful  Is'ik.  Scarclling   numnroti*  en-     ! 
Ijines  vvlth  different  interfnccs  can  be 
llmccOiiSuminB  Thp  lUi?  rosulting  fVoni 
iVif  various  snarrhos  frequently  Include    ^ 
many  of  the  same  documenw.  and  ufc^rrs 
must  Identify  and  ican  duplicate  hits  In    ^ 
order  lo  move  on  to  unlngpected  dor.u-     , 
m(>ilts  In  addition,  diere  is  often  no  way    , 
tn  <!U)r(;  one's  search  qxitry.  y 

An  additional  problem  relalos  lo  a  . 
Croup's  ability  to  cream  a  eustomized  ;, 
knowledee  base  nr  data  collection  that  . 
may  be  shared  and  easily  searched  Pull*  |. 
text  retrieval  may  be  o  great  way  to  mov<>  . 
around  in  undliTorentisted  data,  but  o  ,,• 
group  vvlth  a  particular  domain  exper-  t 
tise.  Stich  as  envirnnmontAl  Iqvw.  might  r 
wish  to  collect  the  variou<t  lnt«ni«t  and  i 
intranet  rosourcfs  and  apply  gtructuro  to  ,, 
such  comeiil  A  rollabortive  method  of  nj 
creating,  maintaining  and  «ntf  turing  a  ;o 
knovvledge  blue  with  a  nrlcndly  folder  j. 
would  h«  useful  Such  knowledge  baso% 
could  also  be  cloned  and  placed  on  «  se-  c. 
euro  extranet  Ibr  a  client,  and  this  addl-  ig 
tional  service  cnuhl  be  offernd  At  no  cost    „. 

Having  collectively  populated  a  jr 
knowledge  base  with  materials,  a  group  s- 
mlght  alio  bencHt  fVoni  having  users  pro-  ijt 
vide  additional  feedback  For  oxftmple.  a  ,| 
user  might  wish  to  provide  ratings  of  as 
particular  documanu  to  reflect  their 
overall  value  or  annotatlonK  tu  identify  .is 
specific  Items  of  mterest.  n- 

Th  the  extent  that  usage  of  tho  collec-  ro 
tlon  Is  monllnrod.  the  3yvt/^m  might  akn  p- 
assesk  which  documanu  may  be  of  more  lio 
or  less  value  to  Ubers  Inoking  for  parilcu-  ck 
lar  items,  and  the  ovaluatlnnj  or  rank-  ip- 
Ings  can  provide  a  foundation  for  group  mt 
fillering  algorithms.  All  of  these  ideas  ul. 
have  already  been  impi«m«nted  to  vary- 
ing degrees  In  existing  Internet  technnJo-  ice 
gies  Reviewing  snmo  of  these  products  >nd 
may  provide  one  vrtth  ideas  about  which  os- 
fealures  ronld  be  usehil  for  on  otBco'f  iko 
next  legal  pracUCO  system  crs 

oc- 
Applloationa  That  Interact  nts 

In  creating  a  practice  iiystcm,  a  law 
olBce  may  wish  to  cumbinn  some  of  thes^or 
technologies,    using   both    off-tho-shelf'he 
ftnd  custom  pieces.  One  common  r«a(ureiCW 
of  these  new  search  tools  Is  the  use  of  a"!'' 
separate  window  or  application  to  creote  '^'"■ 
a  persistant  search  Interface.  The  appll-  «= 
cations  come  in  two  llavors.  more  or  less 
The  first  IS  a  separate  Windows  applica- 
tion  that   can   interact   with   the   Web 
browser  The  second  variety  Implements 
a  solution  completely  within  the  Wub 
browser  witli  a  separate  window. 

Examples  of  the  windows  applica- 
tions category  are  Quarterdeck's  Web 
Compass.!  vvcb  Ferret.'  SSSpider*  ami 
^6X8.»  These  appllcaUons  interact  with 
the   m«yor   search   engines   In    various 

Mr  Hokknitfn  is  a  law  nrnrlirm  ' 


wny;  ,tiid  unrli  nders  mluro';liiig  cdpa- 
biliiiHs  Tor  example.  WBbComi>«s»  Wiib 
reriM  «..d  SSSplder  will  M«rth  multlpln 
search  tfiifilnes  and  combine  the  result- 
iiit  sets  lnlf>  rt  •iiiigic  bui  with  duulkwlts 
ruinovwd  Web  Lompnss  allows  liio  wief 
tr)  ,idil  .uUlilionnI  s^Arcli  Cligiiuis.  sui  h  as 
"Ur.iiici  und  srlicdulo  autoniailr  c'lctkb 
for  now  liifor,.,.v(,on 

Alexa  l^  an  iiilBreiiinf!  |>n.ic  of  plane- 
VMy  Kroupwaie  It  lots  tlio  iisor  run  llii: 
itppliratloo  aiiO  use  tlip  Web  browser  It 
li  if>l  a  tool  foi'  nfllvi:ly  simichlng  m<\jor 
•"ii^iuis.  mili'H<l,  ,is  one  browips  (Ik: 
(iiOKrain  comnuiiiir.xies  with  i\\»  Alc«a 
■»•  rvcr  The  Ali^xn  toolbar  provldi-s  rtill.r- 
cut  s(ali5tic.<)  fthnui  llic  Web  lUf  dml 
<;ln)wv.  rcluled  sites  Alyxu  usnri  cm  pviu 
rrvu'w  a  site  by  givim;  it  a  "tliumbt  op  ' 
I  Ids  piissive  relaiiiig  of  one  site  to  anoth- 
er opens  up  completely  different  avenues 
ond  allows  ih».  ^carvhor  to  peiform  col- 
loboratlve  filtoring  of  sires 

McltinK  Point  Is  groupware  of  a  diffor- 
f^ni  sort.  alluwinK  an  orficH  to  r.olleri  rcla- 


li(in<.liip  Ixfoi  nialJoii  nboiit  doriiiiicnls  in 
PCDOCS.  Ill  MSHxrhangC  or  on  an  in 
traiicl.  The  Inrnrmatioii  is  stored  In  « 
server  data  LaA«,  allowInK  colliJi;''**'  '" 
?;hare  and  scurth  the  rplaUonsliip  inlor- 
m^tioii  uutliored  by  otlicri. 

S'r>mfl  npw  search  enpini^s  ah  provid 
ing  the  f  ,\pabilme";  of  Ihosr  .ipplirnlinns 
For  example.  No,  ll.cr.i  Light  will  prnvidi- 
a  folder  bai^d  view  of  liie  results  of  u 
search  Meanwhile.  liifcrHiun  fhid'' 
searches  six  of  tl>c  ttiujor  searrh  engiiiBS, 
rt'mnves  tluplii  (lies  and  |>ri)viUe.>;  (i  rolf>- 
vunryianked  li.sl 


Browk/aer-Based  Toola 

A  Navigator/Explorer  plw 
to  lh«  Windows  appHr.atloiis 
bu Z'  whirli  li  dlbUlbuIf'd  nl  r 
tool  li  slinil,«r  to  the  Windo 


II  similar 
WcbTUr- 
cost  Thh 
,  applica- 
tions In  ilial  it  searches  iii.yor  search  en- 
gines and  removes  duplicates  from  tlm 
resulting  list;  documents  on  the  list  may 
be  previewed  The  applie/ition  Is  frCc  be- 
rai«i<>  the  company  sells  advertising  jji 


ll.'-  1,,j,l,c„|,oM  wndinv, 

Hiowser  b.iNcd  Kioli  do  not  stop  wlili 
ci'nir«ll7ed  Servdf  rpioiirteo    I  or  f<M't 

pie   .11  IUM\  W<-h  <it«  '  whf  II roni  a 

keyword  seaifli.  a  sfpnrale  iear(  h  if  nl 
window  IS  spun  oil  lo  liclp  the  us<>i  n.ivi- 
gatc  the  sue  Th.s  is  smiilar  in  ,i  Irei^  Imil 
known  n>!  the  Search  "^(MtUilv:,'  h  \n\\- 
srrlpt  hrowsrr  tool  ilr.ii  ^jivr'.  ih'-  u-.ir  a 
fotisisleiil  lnlfrf,i/.e  li>r  io.?//.)iing  ihf 
iimior  ■search  (ingine<i  AJleriiniivelv  Uoii 
Ts  sc.irih  toof  iirondni  a  ■.eparulu  win- 
dow, though  this  (mil  <!(,.  s  i.m  intogifii.- 
the  vurlous  si'ircli  iinginc-.  uUo  ii  singk 
'i)ti;rface  as  dd.v  th.,  Scnr.  h  s.m  llii» 

Ihe  InfoMiiider."'  a  lav.ijenpi/Jnva- 
loised  tool  ilmiUr  to  the  .Sftfiirh  iiitclliie 
uUeriiel';  with  .->  ctnirnli/c^d  «( rver  dau 
bsse.  In  ftddlliim  lo  being  nhlo  to  (jirgo 
ri/e  bcorrh  engints  m  u  Inldf^i  metaphor 
and  then  •tohiiiil  queries  as  th*;  Soarr.li 
Satellite  do«s  lnfoMiild«r  ran  also  slort- 
Ulil.s  and  full-lfixl  index  tlic^  dnnimenl 
for  subsequent  srarclii's  With  ihii  tool  a 
ISKK   winci'is   FAt.r  ni2l 


FROM  :  ANTENNP  GROUP 


PHONE  NO.  -.415  896  1094 


Pipr.  28  1998  10:48W  P. 


,:(l«<. 


iv,:|y; 


lS-,..M.t)l(.  .lklW>^vl■ 

ti/Si-inaiitu    iluiu- 

Iticiils  A";  ivilh  Ali:x,i,  iiScTh  <:iiii  piovldi; 
ranklnj;s  (iT  dociinioiUs.  iiml  ihi-;  liifoi- 
mftlion  Is  iniJndcd  on  Iho  lisCi  of  roMiIlt; 

At  till-  same  liino.  (lcv(ili)pin(nl<;  li.ivo 
iilso  ucrurnrd  in  Itio  orca  orrolliibornlivo 
(illprini'  "  I  his  tcctinoloyy  ilomoii>ilr.'\tos 
h(tw  ft  vnry  sin/iU  sol  of  input  dnt;i. 
pl.iccd  III  thn  roirlcxt  of  a  Inrgi-  niiinlKir 
of  sinill,i.r  users  cnn  jjcncinlc  some  pow- 
erful tollfiborollvc  liHoi  iny  rnsiilts 

Onn  fail  tro  this  tccltnolO((y  in  nction 
.11  th(»  MovloCrilir"  silo  wdcro.  nflcr  il<c 
user  raips  u  do>cn  movicB.  the  nnRiiio 
providn";  rocommnnHniions  based  upon 
llic  1  alings  of  pcopln  wlio  like<l  und  dis- 
likod  ihc  ?onio  movies  Bccimsc  this  ap- 
))ro;irl\  can  provide  meanmRail  rosull 
■:Cli!  vvitlj  u  •iiiiall  input  SCt.  it  5llOw>.  th.ll 
Imiiinn  prefcrcnr.ps  can  clustfir  .uid  \UM 
tliisicrinp  can  subSiiqnonOy  he  usori  Tor 
tlioso  Willi  doubts  nbout  wluro  tcrlinolo- 

Py  is  llCnJ(!d.  tl>l<l  "lili;  piOvido-s  ah  ilUCr- 

Oiling  vision  of  the  opiit-r  limits  of  inyro 
or  Inss  pfi3Sivo.  cuslomcr-drlvrn  tooh. 

CoUBborulivp  filu-ririg  mcchnniims 
rnay  play  un  lii^pononl  rote  in  lover rjglnp 
Uwycr  work  proiluct  ns  now  associates 
look  for  good  work  prorttict  croxlcd  hy 
oilKjrs  Thern  ii  somo  risk  thM  Iho  guld- 
ftiicc  Hint  a  cotliibor.ilivo  liltorlng  syntom 
provides  might  go  beyond  passive  sug- 
gestion and  herd  n  new  user  lownrd  dot,- 
umeiiti  th.^vi  ore  froqncntly  used,  oviir- 
lookiiij;  «  more  obscuro  bul  more 
penclratlni,'  document  f!ut  full-loicl 
suarching  of  lh«  rolleclion  by  the  iisnr 
may  provide  a  ch«>ck  on  such  nuiommion 

rinnlly,  unc  Eito  offers  n  vviiy  to  cronic 
a  fcfcdback  mcchnnlsm  to  o'no's  e-mail 
account  Thn  lilfarmani"  allows  users  to 
"specify  Web  sites,  and  (he  t-nrvcr  then  c- 
mails  users  with  updated  infontiQilnn. 

The  convergcnCB  of  sonrcli  engine, 
grotipwarc  and  collnhorative  .fillcrliig 
technolo(;ios  witltin  lliu  Wob  oiwiron- 
mtini  domnrtslratcs  tlir  possibilities  for 
dovtiopinoni  of  now  Icgnl  prnclico  Sys- 
ten»t>  roll-lext.  rolevrtncy-rankcd  icsull 
lists  may  be  complcmBniod  by  nuthored. 
hlcrarrhical.  coiiloxt-bnsf-d  merhaiiisms 
ol  calnRorizing  Informntiun 

ny  duc.enlrahrln^the  julliorlngoflliis 
roiitcxi  dniii.  the  npijlicatlnns  boroiim  In- 
tfirrtciivo  ({roiipwnro.  nllowinK  «n  onlirc 
pvoup  10  Khiire  bulh  the  work  of  develop- 
tug  Such  a  rnKqnrce  nnd  its  beiiefilt.  The 
ability  to  (fpate  syslcini  with  feedb.itk 
mr'rhaiii<;ins  furthers  ihi'!  groupwaro  np- 
pro.irh,  and  hclpi  create  markers  thai 
indlcalo  wliirh  documents  are  usofol. 
current  or  of  little  eonscquonre 

Collaborative  filtering  lochnoloKics 
that  monitor  the  use  of  the  systcin  and 
provide  lis  visere  with  pnisivo  sujiijcs- 
linns  ubniit  other  doeumrnts  can  inakn 
systems  even  morn  usablo  »o  lawyers 
with  liinticd  compulni  slfilli.  In  the  proc- 
ess users  mfiy  bo  directed  to  dt>rumenlE 
lltftt  they  may  tint  havo  known  oxist 

VVhon  applications  or  their  borver- 
basfid  coinpnncnls  arc  mail-awnre.  the 
systems  might  notify  tisors  when  new 
tii.itcilalb  arc  added  nnd  which  wottld 
tpioUfy  for  a  user's  search  or  be  of  inter- 
est when  a  usor  Is  prriisini;  Ihe  collec- 
tion 

ID  |l|tlf>'/yiVi  itcltnltl  fl*lucV  cr^\/i|Jncti/ 
pioJurlWc2<V) 
(2)  (iv>w.  ferrnr^un  rnn.'uolf>rrd/| 
t.t)  (vu\vwbrytlc(.li  ''C'n/Jir^nlAf.O  l>tm| 


(S)  lw^v 

tOl  (v/ww  wchliirbo  com/) 

(7)  tw%w,ltin  com) 

tm  limp  /Avvv.vl  p»ir,com/)piriio>Arnrrh/). 

10)  (www  ^corlll.,fl  iiim-Ml|vc>nVi>llry/I)iiy/ 
8010/) 

|ll))(.v>v<vlpir.  ...I") 
'    |11)(ivwwtllrO'>ilndcc"<iil 

(12)  (w**  muvlf «  .(Ic  n.ml 
'   (tj)(lill|>//1nf<>riii.Anl  ilMtinniilli  i-fli./)  OB 
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spaces 


7m 
>  fiiuUaiJil 
fearless" 


L 


By  Tracy  Levine 


Icxa  Internet  is  a  unique  firm  that  has  created  a  very 
important  tool  you  can  download  today-for  free.  The  com- 
pany has  its  offices  within  the  Presidio,  in  a  historic  San 
Francisco  building  dating  baclc  lo  circa  1885.  The  exterior 
is  unassuming,  but  inside-the  excitement  and  energy  levels 
are  high. 

Led  by  Brewster  Kahie,  the  corporate  culture  manifests 
innovation,  ideals,  vision  and  an  ethical  consciousness.  Like 
the  executives  of  many  companies  who  have  chosen  the 
Presidio  as  their  headquarters,  the  AJexa  management  team 
is  not  purely  profit-driven.  Rather,  they  are  progressive 
thinkers  who  want  to  be  a  part  of  a  better  tomorrow. 

"Information  should  be  shared-not  the  plaything  of 
billionaires  or  lost  completely."  explained  Alexa  president 
Brewster  Kahle.  "We  are  creating  open,  civic  spaces  and  fos- 
tering the  sharing  and  archiving  of  knowledge." 

Kahle  claims  that  Alexa  Internet  is  the  first  company  to 
offer  Web  navigation  that  actually  learns  from  its  users. 
{More  about  this  point  later!)  Founded  in  April  1996  by 
Brewster  Kahle  and  Bruce  Gilliat.  the  company  provides  a 
solution  to  a  major  problem  of  the  Internet:  the  inability  to 
find  information  quickly  and  easily. 

Alexa  is  a  free  service  that  helps  you  surf  the  Web  faster 
and  much  more  efficiently.  It  works  in  conjunction  with  Web 
browsers  and  is  displayed  as  a  toolbar  on  your  desktop. 
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As  the  Interne!  has  grown  to  overwhelming  and  unmanage- 
able proportions,  navigating  has  become  a  total  nightmare. 
Since  search  engines— even  the  good  ones— rely  on  keywords 
to  locate  information,  you  can  be  thrown,  unwittingly  (and 
unwillingly!)  into  a  melting  pot  of  information  that  is 
completely  irrelevant  to  your  search.  Wading  in  this  quagmire 
of  meaningless  data  is  time<onsuming  and  frustratmg.  Alexa 
cuts  through  all  that  and  effectively  helps  you  target  your 
search  and  even  suggests  links  related  to  your  personal  quest. 

"We  operate  purely  from  the  user's  perspective  and  are 
focused  on  helping  people  make  sense  of  the  Web.  We  think 
the  Web  is  worth  it."  said  Kahle. 

Unfortunately,  as  the  Web  has  become  more  convoluted, 
the  game  has  become  more  complex  while  the  stakes  (and 
potential  profits)  have  increased  as  well  Cynthia  Lohr,  pub- 
lic relations  manager  for  Alexa.  insisted;  "Search  engines  are 
evolving  into  families  of  online  media  properties.  Their 
primary  purpose  is  no  longer  to  help  you  find  information. 
They've  become  malls— trying  to  keep  you  in  their  stores  as 
long  as  possible." 


IIIbhb  Is  IIbI0 


Alexa  is  certainly  very  helpful.  I  used  it  to  research  some 
stories  and  found  the  time-saving  aspect  to  be  most  valuable 
The  key  features  of  Alexa  are: 

•  UlllErB  1)011  ar8-givcs  you  worthwhile  information,  such  as 
who  the  site  is  registered  to,  how  frequently  it  is  updated, 
plus  ratings  and  reviews  from  publishers  and  other 
third-party  sources. 

•  niters  to  P  neHt-delivers  suggested  sites  to  further  flesh 
out  your  search. 

•  flBShtl^  reftrEnCB  SBrviCBS-imegrates.  into  your  browser, 
Britannica  Online,  including  the  new  Britannica  Internet 
Guide,  as  well  as  the  Merriam-Websier  Online  dictionary 
and  thesaurus. 

•  BrdlillB  Uf  tilfi  UIBb-lets  you  access  Alexa's  extensive  archive 
of  the  Web  to  reach  previously  (annoyingly)  unavailable 
'"404-Not  found"  pages. 

This  last  feature  is  most  intriguing.  I,  for  one.  did  not  real- 
ize that  information  is  simply  vanishing  all  the  time.  It's  not 
something  we  really  spend  a  lot  of  time  thinking  about.  Isn't 
someone  in  charge  of  keeping  all  that  information?  Well.,  .no. 
This  fact  was  an  impetus  for  the  founding  of  Alexa. 
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Brewster  Kahle  was  a  key  pioneer  in  the  Internet  publishing 
industry.  In  1989.  he  invented  Wide  Area  Information 
Servers  (WAIS).  an  early  Internet  publishing  system  that 
worked  with  Dow  Jones,  the  New  York  Times  and 
Encychpedia  Britannica.  He  sold  WAIS  to  America  Online  in 
1995  for  $15  million. 

How  did  Kahle  segue  into  his  current  enterprise?  "It  was 
exciting  to  play  a  role  in  the  Internet  publishing  movement. 
Building  an  information  community  seemed  to  be  the  next 
logical  step.  Were  building  something  really  neat  here  that 
will,  we  hope,  make  the  world  a  little  better  for  it,"  he  said. 

Kahle  and  Gilliat  have  embarked  on  a  very  ambitious 
mission.  They  are  archiving  the  Web  (yes.  the  whole  Web)  in 
a  database  for. essentially,  mankind.  Alexa  began  "collect- 
ing" the  Web  in  early  1998.  and  it  now  has  a  minimum  of  four 
snapshots  of  over  800,000  Web  sites. 

So  how  vast  is  AJexa's  archive?  It  has  already  surpassed 
Alexandria  in  size  and  is  now  equal  to  the  Harvard  University 
archives  Company  ofTicials  say  they  expect  it  lo  pass  even 
the  Library  of  Congress  in  the  coming  months. 

Well,  its  certainly  comforting  to  know  that  someone  has 
taken  a  leadership  role  in  protecting  and  preserving  all  that 
information-not  only  for  us  but  also  for  our  future  genera- 
tions. Yet,  Alexa  wants  to  be  more  than  just  a  massive  library. 
Its  founders  have  raised  the  bar  to  an  intriguing  level. 
Information  is  being  shared.  Information  is  being  saved.  And. 
it's  getting  richer  as  more  users  work  with  it. 
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Alexa  actually  learns  (through  an  intricate  form  of  data 
analysis  and  capturing  usage  paths)  from  the  people  who  use 
it.  So.  it  follows  that  as  the  number  of  users  increases,  so  does 
the  value  of  the  information.  Kind  of  like  a  good  chain  letter! 
Cynthia  Lohr  described  it  this  way;  "When  you  walk  on  a 
path  through  the  woods,  you  arc  benefiting  from  the  explo- 
rations of  those  who  went  before  you— finding  the  best  way  up 
the  mountain,  or  down  to  the  sea.  Alexa  seeks  to  do  the  same 
for  the  Web.  You  gain  the  benefit  of  everyone's  experience." 
For  example,  if  you  were  looking  for  background  on  Hong 
Kong,  your  initial  personal  search  might  lead  you  from  the 
official  tourist  site  to  the  discovery  of  sites  for  the  daily 
English-language  newspapers  published  in  that  Chinese  city. 
The  Alexa  software  would  note  the  steps  in  your  search  and 
suggest  those  newspaper  sites  to  later  Alexa  users  Inquiring 
about  Hong  Kong. 
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Alexa  does  profess  to  have  a  cause,  rather  than  just  a  sales 
goal.  But.  then  again,  even  those  bound  by  altruism  have  to 
eat!  There  have  been  300.000  downloads  of  Alexa  so  far- 
all  free  of  charge.  The  question  is.  how  arc  its  owners  making 
a  profit? 

The  company  was  seeded  with  some  of  the  gains  from  the 
founders'  WAIS,  Inc.  transaction.  It  continues  to  grow 
through  its  unique  advertising  strategy.  Alexa  offers  advertis- 
ers  the  singular  ability  to  advertise  in  a  very  targeted  way.  The 
message  is  accessed  through  the  Alexa  toolbar  displayed 
on  the  user's  screen.  Context-oriented  advertising  allows 
advertisers  to  target  specific  Web  sites,  geographic  regions 
and  even  the  time  of  day.  A  golf  club  vendor,  for  example, 
would  benefit  from  advertising  on  Web  pages  that 
reach  golfers. 

Alexa  has  50  advertisers  so  far.  and  the  mariceting  depart- 
ment says  they've  generated  a  lot  of  interest  from  additional 
organizations  that  want  to  do  a  better  job  of  targeting  their 
Web  advertising. 

The  company  has  some  impressive  goals,  and  because  they 
are  m  a  ground-breaking  mode,  they  have  been  able  to  attract 
a  "great  group  of  people"  (27  on  the  team  and  growing). 
Success  and  creative  satisfaction  are  what  the  Alexa  folks  are 
shooting  for. 

But,  what  does  Brewster  Kahle  predict  for  the  future  of  the 
Interoet? 

"Video!  It's  here  now,  but  soon  it  will  be  much  more  preva- 
lent. It  will  change  everything.  Just  think  of  how  you'll  be  able 
to  cut  down  on  business  travel.  Rather  than  attending  a  trade 
show,  you'll  be  able  to  preview  products  through  online 
video.  The  world  of  education  will  be  greatly  enhanced  too 
and  add  a  whole  new  spectrum  of  possibilities!" 

Kahle  has  a  personal  technological  goal  too.  He  is  trying 
to  set  up  an  unprecedented  "sonic  speed"  networking  system 
that  will  unite  all  companies  and  residents  in  the  Presidio. 
After  experiencing  this  executive's  drive  and  enthusiasm,  I 
expect  this  concept  will  soon  be  a  reality. 

To  learn  more  about  Alexa  or  to  test-drive  it  yourself,  you'U 
find  the  company  at  UflinB.aleiQ.IQSL  For  information  about  the 
firm  or  its  advertising  opportunities,  you  can  call  Cynthia 

Lohr  at  11.151  SB1-B786. 

CodteNt'Oriented  advertisiflp  alloiiis 
advertisers  to  target  specific  Uleli 
sites,  aeooraDhicreflioos  and  even 
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Internet  just  keeps  on  growing, 
some  cool  tools  help  you  surf 


By  DENNIS  SEEDS 

Editonal  Systems  Manager 


A  new  n-eeware  program  called 
Al£2t3  IS  making  news  in  the  Inter- 
net worJd  —  a  world  that  counts  56 
million  Americans  as  users.  Alexa 
is  an  add-on  to  your  Internet 
browser  that  gives  you  a  rating  of 
the  Web  site  you  are  viewing,  and 
can  taJte  you  to  lost  Web  pages  that 
have  been  erased. 

The  Alexa  company  has  made  a 
backup  copy  of  the  entire  World 
Wide  Web  That  doesn't  sit  well 
with  some  publishers  who  have 
taken  their  articles  off  Une  not  in- 
tending them  to  be  available  any 
more,  and  some  controversy  is 
brewing. 

Alexa  also  has  shortcuts  to  refer- 
ence material,  such  as  a  dictionary, 
thesaurus  and  encyclopedia. 

In  the  ratings  area,  it  will  tcU  you 
who  has  registered  the  domain 
how  often  the  site  has  been  visited, 
how  often  it  is  updated  and  how 
many  pages  are  on  the  site. 

You  can  download  a  copy  at 
www.aiexa.com. 

Here  are  this  month's  reviews  of 
Greater  Cleveland  Web  sites. 

Heather  Hill  Health 
&  Care  Center 

New  to  the  Web  Is  this  site  for 
Heather  Hill  Hospital.  Health  & 
Care  Center  in  Chardon.  a  non- 
profit facility  for  the  fVail  elderly 
and  physically  disabled. 

The  Web  site  has  interactive  fea- 
tures, such  as  moderated  dis 
cussion  forums  and  a  cool  virtual 
realuy  tour  of  the  facihties.  The 
tours  are  360-degTee  pans  around 
three  options,  the  interior  common 
area  of  the  Corlnne  Dolan  Alz 
hGimer  Center,  a  typical  resident 
room  or  the  outdoor  therapeutic 
park. 

By    using   your   mou.<^e   or   kov. 


The    virtual    tour    Is    definitely 
worth  a  spin. 

URI.:  vvww.heatherhiil.com 
VISUAL:  A 
WORTH  VISITING:  A 


Test.com 


Need  some  practice  on  taking 
test''  Test.com  is  one  Cleveland 
Web  site  that  wiU  help.  Test.com  is 
a  venture  capital  funded  site  that 
sells  career  planning,  employment 
and  other  testing.  Some  tests  are 
free,  and  for  others  there  is  a  fee. 

During  February  and  in  honor  of 
Valentine's  Day,  there  Is  a  Love 
Quotient  test  for  (ree.  It's  not  as 
easy  as  you  mleht  think  —  unless 
you  know  your  Shakespeare,  movie 
trivia  and  history. 

Some  other  sample  tests  are  of- 
fered. The  categories  range  tVom 
citizenship  to  typing  You  answer 
by  selecting  a  radio  button  or  by 
filling  in  the  blank.  Clickhig  on  an- 
other button  scores  the  test  and  you 
receive  the  correct  answers  You 
can  even  create  your  own  test. 

The  site  is  light  on  graphics,  but 
attractive. 

URL;  w%vw.test.coin 
VISUAJL:  B  i^ 
WORTH  VISITING:  B  ^ 


Women's  Hormone 
Center 

The  Women's  Hormone  Cente 
Beachwood  offers  answers  by 
mail  to  medical  concerns 

Other  information  files  are 
hne,  and  for  further  detaik,  a  b( 
is  available.  Excerpts  are  quoted 
line.  You  can  also  turn  to  the  pi 
of  links  and  find  a  dozen  other  si 
on  hormones  and  endocrinology. 

The  site  is  mostly  text. 
URL:  www.honnoiiecenter.coj 
VISUAL:  B. 
WORTH  VISITING:  B 


Denny  Carleton 


Denny  Carleton,  musician  and 
vocalist,  has  played  with  The  Lost 
Souls.  The  Choir.  Moses.  The  Pa- 
gans, Just  Guise,  and  other  rock 
bands  popular  in  the  1960s.  '703  and 
■80s  in  Greater  Cleveland  He  sUll 
plays  music,  but  now  in  the  coffee 
house  and  with  a  Christian  theme. 

He  recently  donated  music,  news- 
paper clippings  and  writings  to  the 
Western  Reserve  Histoncal  So- 
ciety If  you're  into  a  trip  down 
Memory  Lano  and  the  early  days  of 
rock  music  in  Cleveland,  read  the 
history   section  of  his   w«h   =.♦.. 


Lake  Erie  Quality 
Wine  Alliance 

Fourteen  Lake  Erie  wineries  a 
listed  on  this  Web  site,  and  a  tour 
ready  for  the  taking. 

Each  of  the  fourteen  has  either 
mil  Web  site  or  at  least  one  info 
maUon  page.  They  are  Buccia  Vin. 
yards,  Chalet  Debonne  Vmeyard 
Conneaut  Cellars  Winery,  Fir. 
lands  Wmery.  Klingshlrn  Winer: 
Lon^  Winery,  Markko  Vineyar* 
Mazza  Vineyards.  Mon  Ami  Re* 
taurant  and  Historic  Winery  01 
Firehouse  Winery.  Penn  Shor 
Vineyards,  Presque  Isle  Wine  Ce) 
lars  and  Woodbury  Vineyards. 

Their  locations  range  from  th. 
Lake  Erie  Islands  to  the  shores  o 
New  York  State.  Some  offer  bed  am 
breakfast  inns  and  entertainment 

Chalet  Debonne  Vlneyarda,  whilf 
a  Uttle  short  on  photos,  has  a  flnt 
Web  site  among  this  group. 
URL:  www.rinestock.com/ 
leqwa/index.htnil 
VISUAL:  B 
WORTH  VISITING:  B 

Goblin  Bee  Records 

Goblin  Bee  Records  is  a  cooper- 
ative 3az2  and  blues  record  label 
whose  performers  are  from  Greater 
Cleveland.  Eight  titles  have  been 
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What  It  Is 

Alexa  1 .3,  a  browser  aid 
which  reveals  information 
about  the  creator,  or  at  least 
the  host,  of  any  given  Web 
page,  related  sites,  and 
more. 


What  It  Does 

Ever  wonder  who  owns  the  page  you're  looking  at?  Or 
whether  the  information  is  any  good? 

While  Alexa  can't  check  facts  for  you,  the  software  does 
pop  up  a  slim  menu  bar  which  contains  information  about 
the  sites  a  user  visits,  and  acts  as  a  kind  of  site 
authenticator  and  navigational  aid. 

Alexa  puts  two  particularly  powerful  tools  in  the  hands  of 
its  users:  First,  the  company  has  collected  and  distilled  a 
great  deal  of  information  about  what  sites  people  visit 
before  and  after  they  visit  a  given  site.  These  "footpaths" 
of  previous  users  often  turn  up  lesser-known  but 
high-quality  and  useful  sites. 

Alexa  also  has  an  archive  feature.  So,  if  one  hits  a  page 
which  is  no  longer  connected  to  the  Web  -  the  infamous 
"404:  File  Not  Found"  error  -  there  is  a  good  chance 
Alexa  has  preserved  a  copy  of  the  page  in  its  memory 
banks. 

The  tool  makes  it  easier  to  evaluate  the  quantity, 
freshness,  and  popularity  of  the  sites  you  visit  by  noting 
the  number  of  pages  the  site  has,  when  they  were  last 
updated  on  average,  and  the  relative  amount  of  traffic 
that  site  receives. 

In  addition,  Alexa  features  site  ratings,  currently  from 
Yahoo!  Internet  Life  and  the  Britannica  Internet  Guide. 
Privacy  and  content  ratings  are  provided  by  online 
privacy  advocate  TRUSTe  and  content  labeling 
organization  RSACi. 

What  It  Means 

Alexa  posits  a  ftiture  in  which  users  know  the  source  of 
the  information  they're  looking  at  and  have  easy  access  to 
related  content. 
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It  also  envisions  a  community  of  millions  of  Alexa  users 
who,  with  their  individual  efforts,  help  to  map  and  rate 
the  Web  for  each  other. 

What  It  Costs 

The  software  can  be  downloaded  free  of  charge  at  Alexa's 
Web  site,  listed  below. 

Smile  For  The  Camera  » 
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Back  to  Spring  Internet  World 

Written  by  Sean  Wolfe  with  graphic  design  by  Dana  Byerly 


»  Alexa  Internet 


•  TRUSTe  member  directory 

•  Recreational  Software  Advisory  Council  on  the  Intemet 
fRSACil 

-  back  to  TOP 

f  Home  I  News  |  Weather  |  Sports  |  Life  |  Local  Guide  |  On  Air  |  CBS  com  1 
f  Feedback  I  Help  ] 

©  1998,  CBS  Worldwide  Inc.  &  CBS  Mass  Media  Corporation, 

All  Rights  Reserved. 

Please  click  here  for  more  copyright  information. 
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A  SICONO  LOOK:  18  IT  TIME  TO  PU^H  YET? 

Push  is  a  way  for  content  providers  to  send  you  updated  information. 
It  you've  downloaded  either  Microsoft  internet  Explorer  4.0  or 
Netscape  Navigator  4.0,  you've  probably  heard  about  these  efforts 
(known  as  Active  Desktop  and  Netcaster,  respectively)  For  those  of 
you  who  have  finally  upgraded  your  browsers,  is  it  time  to  push  yef 


EXPLORER  4.0 

When  you  download  4  0,  you  li 
be  asked  if  you  want  a  "Web 
Integrated  Desktop"  By  click- 
ing "yes"  (we  don't  recommend 
this),  your  entire  desktop  will 
behave  as  if  it  were  a  browser 
Click  on  the  Channel  button  on 
your  browser  to  open  a  selec- 
tion of  push  content  sites 


NAVIGATOR  4.0 

To  use  Netscape's  push  technol- 
ogy, you  need  to  download  the 
full  Communicator  4  03  suite  to 
get  Netcaster  (wwwnetscape 
com).  Open  Navigator,  then  click 
on  Communicator"  at  the  top  of 
the  browser  Scroll  down  to 
"Netcaster "  This  will  open  a  sep- 
arate "browser " 


SHOULD  YOU  BOTHER? 

The  answer  is  still  "no."  The  concept  is  interesting  and  the  results 
are  pretty,  but  ordinary  Web  sites  offer  the  same  content,  and  faster 
Active  Desktop  is  quicker  than  Netcaster,  but  the  integrated  desktop 
IS  a  nuisance.  T1  users  will  have  an  easier  time  with  push,  but  those 
with  access  at  28  8Kbps  or  slower  should  simply  forget  about  it 


AND    DOWNS 
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"Push  inh'tnlo^Ki  NeruLijer 
trncl  Active  Dr^hop  nick  up 
niajur  Jr.i:  spiwe. " 
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Alexa  (www  aiexa  com) 

The  ticc  Weh  navigarioa  add-on  provides  users  vvith  a  vanety 
ot  cool  services,  <ajch  ui  link  suggesaons  (for  finding  new  sites 
based  on  your  surfing  preferences),  quick-click  access  to 
Bntaiuiic.i  Online  and  Mcmitm-Webiter  Durivnaty  and 
Thfsn-'crM  reference  rouls,  an  archive  of  iiTem<rvable  404  Nor 
I  ound  pages  (for  locadng  p.iges  rhai  have  niystenoiisly  disap- 

j   pearcd  from  the  Web  world),  and  the  capability  to  send  and  te- 

I  ceive  instant  inessajjcs. 

'      .\lexa  .ilso  provides  site  sradscics  for  c-vcry  Web  page  you 

I  \ibit  (^nd,  incideiually,  links  to  available  V-Lsfi  site  reviews). 
Find  out  who  owns  the  sue  you're  visinnjj,  how  many  pages 
Jeep  It  goes,  how  frcc]uendy  it 
is  updatcti  by  its  master,  and 
how  p>i')pular  it  is  among  other 
AJexa  users  The  applicaoon 
lies  across  the  bottom  of  your 
browser  as  a  separate  tool  bar 
and  requires  }  MB  of  hard 
disk  space.  Yov  must  be  run- 
ning Windows  9S  or  Nl"  and 
h.ive  Mcrseape  Nnngaror  2.0 
or  later — or  Microsoft 
Internet  Explorer  3,0  or  lat- 
er— to  run  the  program 


Search  Upgrade 

Try  out  Lycos's  new  search 
features  at  i*vvw./ycos. 
com/search.  There,  you  car 
seek  out  words  and  topics 
that  appear  within  Web  page 
titles.  Web  addresses,  or 
specific  Web  sites  instead  of 
querying  the  entire  NbI, 


For     Y  ■  L  I  f I '  z    oaily    duo    o^    oownloads.     visit      www.ooubledowmiooo.co 
MAPt  ipsa     vahoo'  internet  lifg 
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INFOMARKET 


Web  navigation  Utility 


llexa'sfreelliYigator 
deftly  searches  for  Well  sites 


By  Howard  Millman 

IF  you're  tired  of  sifting 
through  the  infoirash  increas- 
ingly returned  by  standard  Inter- 
net search  engines,  you  may 
want  to  try Alexa  Internets  free 
browser  add-on.  Alexa  1.4. This 
product  can  point  you  to  Web  iites 
with  quality  information  and  then 
suggest  other  sites  that  may  offer 
similar  or  additional  data.  In  my 
trills.  Alcxa  made  about  an  equal 
nuinber  of  worthwhile  and  worth- 
less reconurendaiions,  which  siill 
put  mi  ahead  of  the  game. 

For  example,!  wanted  to  purchase 
a  35mm  camera.  I  visited  http:// 
www.allcatriera.com,  a  site  that 
s«lls  new  and  used  cameras.  Alexa 
dutifully  .'suggested  other  photo- 
graphy sites  to  visit  next.  However, 
because  Alexa  did  noi  know  that 


I  wanted  lo  purchase  a  camera,  it 
Suggested  sites  that  gcncrically  relat- 
ed to  photography  but  not  camera 
sJes.  That's  understandablebecause 
Alexa,  and  its  primary  competitors 
Wise  Wire  and  Firefly,  could  inter- 
pret my  actions  but  not  my  purpose. 
Alexa  partially  bases  its  conclu- 
sions about  3  site's  content  Ort  its 
reading  of  the  metj  tags  in  a  site's 
header.  Consequently,  its  sugges- 
tions' accuracy  relate  to  the  honesty 
and  competence  of  the  site's  owners. 

Crawling  ttie  Web 

Alexia  knowledge  of  a  site's  content 
results  from  Alexa  Internet's  ambi- 
tious program  to  crawl  and  catalog 
the  Web.  This  mind-numbing  oper- 
ation offers  a  valuable  side  benefit  to 
Alexa  users.  If  a  Web  site  or  a  page 
you  want  no  longer  exists,  the  tool 
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AUXA'STOOLBAR  sirs  ai  the  bottom  of  your 
browser.  When  clicked,  ilpops  upsurges- 
tions  on  where  lo  go  next,  as  well  as 
site  statistics. 


can  deliver  it  up  from  its  archive, 
providing  it  has  crawled  that  Site. 
When  I  tried  to  resurrect  a  site.  1  had 
to  wait  about  15  minutes  as  Alexa 
ground  through  its  tape  archives, 
but  the  product  delivered.  To  date. 
Alexas  voracious  spider  has  crawled 
more  than  650,000  Web  sites. 

If  Alexa's  spider  has  not  crawled 
a  site,  then  diat  site's  ratings  and  con- 
tent analyses  depend  solely  on  users 
who  have  visited  and  evaluated  the 
.■iite.  A  site  owner  can  request  that 
Alexa  indudcorexcludt-asite  by  fil- 
ing a  request  at  Alexa's  Web  site. 

As  well  as  navigating  and  making 


content  recommenda- 
tions, Alexa  reveals  other 
site  information.  For  ex- 
ample, a  composite  rnting 
indicate.'t  how  other  visi- 
tors lUvcd  tlie  site  and  who 
the  site  owner  is.  and  pro- 
vides statistics  asse.':sing  its 
orverall  performance  and 
the  freshness  of  its  content. 
You  can  add  your  opinion 
of  (he  site  by  clicking  on  a 
burton 

Alexa  suffers  from  some 
technical  limitations.  For 

example,  ififs  used  behind 

a  corporate  firewall,  proxy  settings 
must  be  recon&gured  to  let  its  data 
esoJianges  continue  unimpeded. 

Unobtnisive  presence 

Alexa  limits  its  presence  on  your 
desktop  to  an  unobtrusive  toolbar. 
The  toolbar  also  carries  ads,  which 
pay  the  freight,  and  free  links  to 
Standard  reference  matcritJs  such  as 
the  Encyclopedia  Britannica  and  the 
Merriam-  Wefcter  dictionary.  A  chat 
feature  lets  you  conference  online 
with  Alexa-using  colleagues. 

I  recommend  that  you  use  Alexa 
to  supplement,  not  supplant,  a  tra- 
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This  iiBlpieWeb  navigator  slD  on  the  \,<n- 
lom  of  your  btewur.  h  jeclis  tD  litiprow  on 
traditional  searth  engine  by  cnmbininj  a 
collaborative  filtering  technique  and  grad- 
ing system  with  a  seareh  engine. 

■  Pros:  Free  easy  to  use. 

•  Cons:  Suggestions  vati«i  from  a  per- 
leatTBtt^itotflttllyiffelwanctCTdsto 
ovtiloeksTnaller  sites. 

(m)882'5392(t!iil-f!te}.mj 
56H$00:(wc(4151Sil-e?9S; 
hnj)J/www.ele»e.com, 
-  Woerfe 

■  Platfum:Wln(lim95.m(loi»imiJ). 

ditional  senrch  engine.But  as  Aiexa 
divines  your  research  goals,  it  will 
minimize  the  need  to  sift  through 
the  vast  amount  of  cxtraneot«  data. 


HowardMUlman  (ktmllman&ibm 
.net)  operates  the  Data  System  Sir- 
vice  Croup,  in  Croion,  N.  Y, 
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Your  Cyberspace  Trail  Guide 

A  software  start-up  profits  by  archiving  ttie  Web's  most  popular  pathways. 


I 


F  YOU  VE  EVER  FELT  LOST  ON 

the  World  Wide  Web,  a  pioneering  Inter- 
net scientist  has  designed  a  new  software 
product  for  you.  Just  as  hikers  benefit  from 
old  trekking  paths,  Web  surfers  using  Alexa 
Internet  can  follow  the  trails  of  thousands  of 
browsers  who  preceded  them. 

Alexa  is  a  navigator  that  works  in  tandem 
with  Netscape  or  Internet  Explorer.  (Paid 
for  by  advertisers,  it  is  free  of  charge  to  users.) 
"I  think  of  it  as  a  co-pilot  for  wandering 
around  cyberspace,"  says  Jerry  Michalski,  a 
technology  analyst  at  the  newsletter  Release 
i.o,  who  predicts  it  will 
be  a  hit. 

Insiders  call  Alexa's 
creator,  Brewster 
Kahle,  37,  a  brilliant 
visionary.  "Everything 
he  touches  turns  to 
gold,"  says  Joella 
Paquette,  marketing 
development  manager 
at  Digital  Equipment 
Corp.'s  Network  Systems  Laboratory. 
Designing  supercomputers  shortly  after  grad- 
uating from  Massachusetts  Institute  of  Tech- 
nology, Kahle  then  created  some  of  the 
earliest  Internet  publishing  tools  and  pro- 
duction services  that  enabled  the  government 
and  private  firms  to  get  on  the  Net.  America 
Online  bought  him  out  in  1995. 

Alexa,  his  latest  brainstorm,  can: 
■  Locate  extinguished  links  whenever  you  hit 


®  Alexa 

Navigate 

the  web 

smarter 
faster 
easier 


a  dead-end  on  the  Web,  retrieving  and  dis- 
playing Web  pages  that  no  longer  exist; 

■  Suggest  where  to  go  next  on  the  Web,  based 
on  the  records  of  previous  visitors; 

■  Rank  the  site  you're  visiting  and  display 
information  about  its  owner  and  contents. 

Alexa  also  donates  everything  it  finds  to 
a  not-for-profit  information  warehouse 
Kahle  recently  founded.  The  Seattle-based 
Internet  Archive  is  a  digital  library  that's 
amassing  every  publicly  accessible  Web  page, 
newsgroup  posting,  and  file  that's  ever  flashed 
into  existence.  Buried  in  this  mountain 
of  data  are  the  path- 
ways trod  by  millions  of 
Internet  users,  which 
provide  valuable  clues 
to  how  people  use  the 
Net,  part  of  Kahle's  his- 
torical mission. 

Because  the  life 
span  of  the  average 
Web  page  is  just  75 
days,  our  "digital  her- 
itage" is  disappearing  fast,  Kahle  explains. 
"The  archive  will  be  seen  a  century  from  now 
as  an  unprecedented  collection  of  the  human 
voice."  It's  already  eight  terabytes  big — that's 
one  trilhon  bytes — and  growing. 

Download  Alexa  at  www.alexa.com.  For 
now,  you  must  run  Windows  95  or  Windows 
NT  4.0.  A  Macintosh-compatible  version  is 
in  the  works. 

— Peter  Zheutlin 
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Alternative  doorways  to  the  Internet  are  popping 
up  in  the  spirit  of  free-flowing  information. 


THERE  were  only  a  few  snick- 
ers last  week  at  a  new-media 
conference  in  Chicago  when 
someone  asked  a  group  of  executives 
from  Web-search  companies  which 
of  them  would  be  the  first  to  buy  a 
major  television  network.  It  was  a 
sign  of  just  how  quickly  the  likes  of 
Yahoo,  Lycos,  Excite  and  Infoseek 
have  grown,  how  grand  their  mass 
media  dreams  have  become  and  how 
much  their  original  vision  of  the 
World  Wide  Web  has  changed. 

No  longer  content  to  direct  a  user 
to  the  Web  sites  most  relevant  to  the 
individual's  interests  and  tastes,  the 
former  search  services  are  intent  on 
creating  "portals"  to  the  Internet, 
complete  with  dial-m  service,  free  E- 
mail  and  personal  Web  pages,  paging 
and  messaging,  customized  news,  lo- 
cal weather  and  stock  tracking.  The 
aim :  get  the  suckers  under  the  tent 
and  keep  them  there.  That's  a  radi- 
cal departure  from  the  original  busi- 
ness model  of  catching  the  custom- 
er's eye  on  the  way  to  other  sites. 

The  original  search  model  was 
deeply  steeped  in  the  Internet  ethos 
—  the  idea  that  information  and  traf- 
fic flows  freely  from  site  to  site  and 
value  is  added  by  serving  that  urge, 
not  resisting  it.  It  is  an  ethos  that  fa- 
vors entropy  over  organization,  an 
open  range  of  Web  publishing  com- 
pared with  the  stockyard  chutes  of 
the  portal  sites.  And  it  is  an  ethos 
that  for  many  in  the  Web  busmess 
still  endures  —  despite  the  portal  fad. 

"There  are  between  5  and  10  mil- 
lion content  areas  on  the  Web,  and  in 
many  ways  the  Internet  is  more  like 
print  publishing,"  observed  Brew- 
ster Kahle. 

Mr.  Kahle  is  an  Internet  pioneer 
whose  1989  invention,  the  Wide  Area 
Information  Server  —  or  WAIS  — 
was  a  pre-Web  system  for  searching 
distant  data  bases  on  the  Internet. 
He  later  sold  his  on-line  and  software 
publishing  company,  WAIS  Inc.,  to 
America  Online. 

The  main  portals  are  following  a 
model  that  mimics  the  control  and 
distribution  of  cable  television  net- 
works. But  Mr.  Kahle  thinks  that  is 
the  wrong  model.  "There  are  16,000 
journal  publishers  in  print,  real  di- 
versity. And  everyone's  experience 
is  different.  Do  we  need  a  TV  Guide? 
I  think  we  need  something  a  little 
more  sophisticated." 

Certainly  the  current  search  serv- 
ices are  far  from  exhaustive.  A  study 
released  Inst  month  by  the  NEC  Re- 


search Institute  of  Princeton,  N.  J., 
indicated  that  even  the  most  thor- 
ough service,  Hotbot  (www.hotbot- 
. com),  has  indexed  only  34  percent  of 
the  Web's  estimated  320  million 
pages. 

But  Mr.  Kahle  is  hardly  a  disinter- 
ested observer.  He  is  the  president 
and  co-founder  with  Bruce  Gilliat  of 
Alexa  Internet,  a  two-year-old  start- 
up company  based  in  San  Francisco. 
Their  product  is  Alexa,  part  Web 
browser,  part  navigation  service.  Us- 
ers download  the  software  from 
www.alexa.com  free,  after  which 
Alexa  manifests  itself  as  a  thin  tool- 
bar under  the  regular  Web  browser 
—  whether  the  PC  is  using  the  Net- 
scape Navigator  or  Internet  Explor- 
er from  Microsoft. 

Alexa  offers  quick  access  to  infor- 
mation on  each  site  visited  (who 
owns  it,  how  much  traffic  it  gets,  and 
how  Alexa  users  rated  its  content), 
provides  links  to  other  similar  sites 
and,  lately,  tiny  advertising  mes- 
sages keyed  to  the  user's  browsing 
selections.  About  350,000  copies  of  the 
program  have  been  downloaded  and 
there  are  100,000  regular  users,  ac- 
cording to  Mr.  Kahle. 
• 

Alexa  is  a  geeky  end-run  around 
the  sleek  mass  media  dreams  of  the 
search  engine  companies.  While  Ex- 
cite, Yahoo,  Lycos  and  Infoseek  are 
adding  as  many  features  as  possible 
to  keep  users  on  their  sites  for  as 
long  as  possible,  Alexa  encourages 
wide  and  frequent  grazing  by  recom- 
mending sites  wherever  the  user's 
interests  may  lead,  based  on  the  per- 
son's past  preferences  —  and  based 
on  the  preferences  of  other  users 
who  have  frequented  the  same  sites. 

Mr.  Kahle  calls  this  approach 
"contextual  navigation."  The  more 
users  in  Alexa's  data  base,  the  better 
the  similar-preferences  software 
works  —  and  the  more  precisely  that 
Alexa  can  tailor  its  ads  to  individual 
users.  In  other  words,  the  more  that 
users  surf  outside  the  main  portals, 
the  better  Alexa's  revenue  stream. 

Alexa  is  named  for  the  library  of 
Alexandria,  the  ill-fated  attempt  of 
the  ancient  Greeks  to  amass  all  of 
the  literate  world's  printed  knowl- 
edge. And  in  keeping  with  this  ideal, 
Alexa  brings  the  emphasis  in  Web 
navigation  back  to  content  —  not  just 
a  reader's  digest  of  the  Web. 

And  yet,  in  choosing  to  name  his 
venture  after  an  ambitious  idea  that 
ultimately  fell  short,  Mr.  Kahle  is  im- 


plicitly conceding  that  the  sheer  size, 
growth,  and  second-to-second  muta- 
bility of  the  Internet  makes  it  almost 
impossible  to  amass  the  collected 
works  of  the  Web.  The  reason  the 
current  search  services  consistently 
rank  among  the  most  popular  Web 
sites  is  that  many  people  presum- 
ably do  want  some  winnowing. 

But  the  mass-market  model  need 
not  be  the  only  portal  approach.  "We 
believe  that  people  are  more  in  need 
of  filters  than  catch-alls,"  said  Rufus 
Griscom,  co-founder  of  the  artily 
erotic  on-line  magazine  Nerve 
(www.nervemag.com).  That  is  why 
Nerve,  which  bills  itself  as  "literate 
smut,"  has  created  its  own,  more 
narrow  portal :  a  directory  of  sexual- 
ly oriented  Websites. 

And  Nerve  it  is  not  alone  in  provid- 
ing a  narrower  doorway  to  the  Web. 
Alternative  portals  are  everywhere, 
including  Razorfish's  "Disinforma- 
tion" search  engine  (www.disinfo- 
.com)  that  provides  links  to  various 
subculture  sites,  and  "John  Skilton's 
Baseball  Links"  (www. baseball- 
links. com),  perhaps  the  most  com- 
plete guide  to  baseball  on  the  Web. 

These  alternate  portals  are  evi- 
dence of  the  Internet  ethos  that  re- 
fuses to  conform  to  a  mass  media 
structure.  In  their  race  to  emulate 
mainstream  media  giants  like  Time 
Warner,  CBS  and  Disney,  the  search 
engines  may  be  forgetting  the  very 
phenomenon  that  brought  them  into 
being:  the  Internet  is  a  medium  of 
creators  as  much  as  it  is  a  medium 
for  consumers. 

Tom  Watson  is  editor  and  co- 
founder  of(a'NY  (www.atnewyork- 
.com),  an  information  service  Ikat 
focuses  on  New  York's  interactive 
industrv. 
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